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PREFACE 


This book is based on a course presented at the Lewis Research Center. 
The course was given to engineers and scientists who were interested in in- 
creasing their knowledge of differential equations. Those results which can 
actually be used to solve equations are therefore emphasized; and detailed 
proofs of theorems are, for the most part, omitted. However, the conclusions 
of the theorems are stated in a precise manner, and enough references are 
given so that the interested reader can find the steps of the proofs. 

In spite of the fact that differential equations is a branch of mathematics 
which is to some extent a collection of loosely related ideas, we have attempted 
to impart as much unity to the subject as possible and to point out the connec- 
tion between the various topics. Although the material is not new, some parts 
of the presentation are unconventional. Certain modern ideas, such as the 
theory of matched asymptotic expansions, which have not worked their way 
into most conventional texts, are also included. In the chapter on numerical 
methods, we have attempted to include a discussion of the considerations 
which should be taken into account when applying the methods to various 
problems which arise frequently in practice. Techniques for solving first-order 
partial differential equations are discussed in the chapter on systems of ordi- 
nary differential equations since it is felt that these topics are easier to under- 
stand when they are presented simultaneously. 
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CHAPTER 1 



In this chapter we shall introduce some preliminary concepts and try to 
gain a certain amount of insight into the behavior of differential equations and 
their solutions. Such questions as what is meant by the solution of a differential 
equation and what is the appropriate number of solutions will be considered 
with some care. 

Leibniz, in 1676, was probably the first to introduce the term “differential 
equation.” However, the study of differential equations had its beginnings 
somewhat before this time in investigations of physical phenomena. Ever since, 
developments in the field of differential equations have been closely related to 
the physical sciences. 

1.1 SOLUTION OF EQUATIONS IN GENERAL 

In this section we introduce some concepts which are needed for the 
subsequent presentation. First, in order to obtain a geometric interpretation 
of certain results, we shall have occasion to interpret a set of n variables, say 
Xi, . . ., x n , as coordinate axes in an n-dimensional space. A particular set 
of values, say x%, . . . , x%, of the n variables are then the coordinates of a 
point in this space. When n = 2 and n = 3, these spaces are the very familiar 
two- and three-dimensional Euclidean spaces. Hence, as will be seen, this 
procedure allows us to use our geometric intuition about two- and three- 
dimensional spaces to “picture” results about functions and equations involving 
any number of variables. 

We shall not need to use many of the mathematical properties of the 
n-dimensional spaces; however, it will be important to have a careful defini- 
tion of certain special regions in these spaces. Thus, let x*}, . . ., x° be the 
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coordinates 1 of a fixed point in an n-dimensional space, and consider the 
collection of all points whose coordinates satisfy the inequalities 

\x\ — x?| < 8, . . \x n — x%\ < 8 

for some positive number 8. When n = 2, these points form the interior of a 
square with center at (xu £ 2 ) and sides of length 28. When n = 3, the points 
form the interior of a cube with center at (*?, x\, x%) and sides of length 28. 
The reason for using the word “interior” is that the boundary points which 
satisfy the equalities 

\xi~ x?| = 8, . . |x„ — *2| = 8 

have been omitted from the collection. 

Next we define a domain D to be any region of the n-dimensional space 
which satisfies the following two conditions: (1) there exists a positive number 
8 for each point belonging to D such that every point Xn 

lying in the range 


|a:i — jc?| < 8, . . — < 8 

is also a point of D ; and (2) any two points in this region can be connected by a 
continuous line. Thus, in a two-dimensional space the first condition requires 
that we be able to draw around each point of D a square whose interior lies 
entirely within D. This is illustrated in figure 1-1. The only points where this 
is not possible are the points which lie on the boundary T of D. This is because 
every square about a point of T must include at least some points which do not 
belong to D. Thus, the first condition serves to exclude the boundary points 
from D. 

The second condition simply requires that the region not be composed of 
two or more disconnected subregions, as shown for two dimensions in figure 
1 - 2 . 

A neighborhood of the point *?, . . ., x% is simply a domain which contains 
this point. Evidently, the collection of all points which satisfy the inequalities 
in condition 1 form a neighborhood of the point . . ., x° n . 


1 Instead of saying that x*[, . . x° n are the coordinates of the point, we shall frequently say that the crffatfion 
x° n is the point. 
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Figure 1-1. — Interior and boundary points for a domain D. 



Figure 1—2. — Disconnected regions which fail to satisfy condition 2 for a domain. 


A single-valued function in the x, y-plane can be roughly thought of as 
one or more curves which associate a single value y with each value x. For 
example, the function y=x 2 / 2 shown in figure 1-3 associates the single number 
*8/2 with each point *o. On the other hand, a multivalued function associates 
more than one value of y with each value of *. For example, the function 
tan -1 *, which is shown in figure 1—4, associates infinitely many values of y 
with each value of *. 

In order to avoid ambiguity we shall always suppose that every function 
which is encountered is single valued unless explicitly stated otherwise. 

Consider the equation 


F(x,y) = 0 


(1-D 
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where F has continuous first partial derivatives with respect to its arguments. 
We say that the function y = /(*) is a solution of this equation if 

F[x,f(x)]= 0 

for all values of * for which fix) is defined. The solutions y=f(x) of equation 
(1-1) are said to be determined implicitly by this equation and are called 
implicit functions. 

In fact, the implicit function theorem states that for every point ( x 0 , yo) 
such that 


F(* 0 ,yo) = 0 


and 


— (* o ,yo)^0 (1_2) 

there is a unique, continuous solution y—f(x) of equation (1-1) which is de- 
fined on some neighborhood of the point *o, which satisfies the condition 
yo—f(x o) and which has a continuous first derivative in this neighborhood. 

Now if F(x, y) is a reasonable function, equation (1-1) will be the equation 
of a curve in the*, y-plane. For example, the implicit relation between* andy 

F(x,y)=x — y 3 = 0 (1-3) 

is the equation of the cubical parabola shown in figure 1~5. It has an explicit 
solution given by 2 

y = /(*) = (sgn*)|*| 1/3 (1-4) 

for all values of*. 


2 The function sgn* is defined by 

f +1 for * s* 0 
sgn x= | 

l — 1 for x < 0 
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Figure 1-5. — Cubical parabola x = y 3 . 


Differentiating this equation with respect to x shows that 


dy 

dx 


-fix) 


1 

3|*| 2/3 


Hence, condition (1-2) is satisfied at any point A on the curve in figure 1-2 
which does not coincide with the origin. And, as required by the theorem, 
equation (1-4) provides a unique solution with a continuous first derivative 
in some neighborhood of this point. However, condition (1-2) does not hold 
at the origin. But then the solution (1-4) has an infinite derivative (which is 
certainly discontinuous) at this point. 

More generally, consider the equation 


F(X i,X 2 , . . .,ACn,y)=0 


(1-5) 


A solution to this equation is a function 


y=f(x i,* 2 , . . . ,x „ ) 
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with the property that 

F\_X i, X 2 , . . •) ^2) • • • * Xn ) J 0 

for all values of Xi, Xi, . . . , x n for which /is defined. In order to obtain a 
geometric picture of equation (1-5) and its solution, it is convenient to think 
of the variables x u * 2 , . • x n , y as the coordinates of a point in an (n + 1)- 
dimensional space. 

Now suppose that the function F has first partial derivatives with respect 
to its arguments in some domain D of this space. The implicit function theorem 
now shows that for every point (x? , x®, . . .,Xn,y°)ofD such that 

F(x%,x§, . . .,jcS,/ ) )=0 

— (*?,*§, . . .,x^y°) #0 

there is a unique continuous solution 

y=f(x u x 2 , . . . ,x n ) 

of equation (1—5) which is defined in some neighborhood of the point x®, 
x$, . . ., satisfies the condition y°=f(x^, x%, . . ., x £), and, has continuous 
first partial derivatives in this neighborhood. 


1.2 DEFINITION OF DIFFERENTIAL EQUATION 

A differential equation is an equation connecting the values of a function, 
called the dependent variable, the derivatives of this function, and certain 
known quantities. If the dependent variable is a function of a single variable 
(independent) variables, the differential equation is called a partial differential 
differential equation. If the dependent variable is a function of two or more 
(independent) variables, the differential equation is called a partial differential 
equation. 

Thus, an ordinary differential equation is an equation of the form 


F ( XY *y ^ 1=0 

*\?' y 'dx' ’ ' "dx*l 


(1-6) 
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where the positive integer n, the order of the highest derivative which appears 
in equation (1-6), is called the order of the equation and F is a function of 
the indicated n + 2 quantities. 

If the function F in equation (1-6) is a polynomial of degree m in the 
highest order derivative d n yldx n , we say that it is a differential equation of 
degree m. Thus, the first-order differential equation of degree m has the form 3 4 

m 

F(x,y,y') =M 0 (x,y) (y') m + ^ M k (x,y)(y') m ~ k = 0 

The differential equation (1-6) is said to be linear if the dependent vari- 
able and all its derivatives appear only to the first degree. Thus, the general 
linear equation can be written in the form 

' - + a "-i ( x )^fa + a n{x)y+b{x)=Q (1-7) 

When the function b(x) in equation (1-7) is identically zero, the equation is 
said to be homogeneous. 

1.3 SOLUTIONS AND INTEGRALS OF DIFFERENTIAL EQUATIONS 

A ( particular ) solution 4 of the differential equation 5 (1-6) is any n- 
times differentiable function f(x) defined on some interval a<x<b (which 
may be infinite) such that equation (1-6) becomes an identity when y and its 
derivatives are replaced by f(x) and its derivatives. Hence, 



for all x in the interval a < x < b. Thus, the function y=sin a: is a solution 
of the differential equation y" +y— 0. 


3 We shall frequently write /' or /'(*) for df(x)(dx J f ff or f" {x) for d z ffdx 2 , . . ., and/<"> or /<">(*) in place 
of d n fldx n for n= 1,2, . . . 

4 The term “solution” was first used by Lagrange (1774). 

5 There are also solutions known as weak solutions which satisfy the differential eq. (1-6) only in a certain average 
sense. We shall not pursue this topic further here. The interested reader is referred to ref. 1 for an elementary treat- 
ment and to ref. 2 for a more advanced discussion. 
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Now consider the equation 

F(x,y, c,,c 2 , . . ., c„) =0 (1-8) 

in which x and y are variables, and c u c 2 , . . ., c„ are independent 6 arbitrary 
constants. 7 We shall suppose that equation (1-8) possesses solutions for at 
least certain values of the constants Ci, C2, . . .,c„. 

Upon differentiating equation (1-8) with respect to x, n times in succes- 
sion, we obtain 


dF dF , 
dx dy y 


= 0 


d 2 F d^F_ , d 2 F 
dx 2 + Z dxdy y + df 


(?'r+ d £y"=o 


d^F 

dx n 


+ . . . 


+T- y (n) =0 
dy J 


If the n constants can be eliminated between these n equations and equation 
(1-8), we can, upon carrying out this elimination, obtain a differential equation 8 


G[x,y,y',y", . . .,y< n) ]=0 (1-9) 

It is clear, from the manner in which equation (1-9) was obtained, that 
every solution of equation (1-8) satisfies 9 equation (1-9). The function F 
is therefore sometimes referred to as a primitive of equation (1-9). In many 
instances the solutions of equation (1-8) for y as a function of x can be ex- 
pressed as formulas of the form 


6 This means that eq. (1-8) is not expressible in terms of fewer than n constants. For example, y 2 — x 2 + c^ — c 2 
depends effectively only on the single constant c = c\ — c 2 . 

7 We assume that all functions are differentiable as many times as is necessary for the argument and that dFjdy 
is not identically zero. 

8 In practice it usually will not be possible to carry out all the algebraic operations which are necessary to obtain 
an explicit formula for eq. ( 1-9) . 

9 However, as we shall see in the following example, the converse of this statement is not necessarily true. 
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y—f(x, Ci, c 2 , . . . * On) (1-10) 

which are parameterized by the n constants Ci, c 2 , . . c„. We therefore 

anticipate that, for many nth-order differential equations, there exist formulas 
of the form (1-10) each of which contains n independent constants Ci, . . ., c„ 
and provides a solution to its corresponding equation for every set of values 
of these n constants for which the formula makes sense. 10 Such a formula, 
if it exists, is said to be a general solution of the equation. 

This terminology should not be interpreted to mean, however, that every 
differential equation possesses a single general solution from which every 
solution to the equation can be obtained by suitably choosing the values of 
the constants. This is only true when certain restrictions are imposed on the 
differential equation (1-6). 

For example, consider the equation 11 

F(x, y, c) = c 2 ~ n [x~ c (n-1 >] 2 + y n — 1 = 0 for n=l or n = 2 (1-11) 

where c is an arbitrary parameter which can take on any real value. By differ- 
entiating this equation with respect to x we find that 


F x + y'F v — 2c n ~ I [x: — = 0 


( 1 - 12 ) 


First, consider the case where n= 1. Then upon eliminating c between 
equations (1-11) and (1-12) we obtain the first-order linear equation 


/+ m zii =0 

y x—1 


(1-13) 


Now, for each value of c, equation (1-11) possesses a single solution 

y=\— c{x — l) 2 for — <»<*<oo (1-14) 

This must, therefore, be a general solution to the differential equation (1-13). 


10 It is usually necessary to restrict the range of the c’s and of x in order to avoid imaginary expressions and other 
degeneracies. 

"When n— 1, the equation represents a family of parabolas with vertices at the point x— 1, y= 1; and when 
n — 2, it represents a family of unit circles with centers on the x-axis. 
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And it happens that every solution to equation (1-13) can be obtained from 
equation ( 1-14) by a suitable choice of the constant c. 

Next, consider the case where n = 2. Then upon eliminating c between 
equations (1-11) and (1-12) , we obtain the differential equation 


yHy ,2 +l) — 1 = 0 


(1-15) 


For each value of c, equation (1-11) now possesses the two solutions 12 


y= VI — (* — c) 2 ) 
y= — Vl — (x — c) 2 f 


for \x — c\ ^ 1 


(1-16) 


Hence, both of these formulas are general solutions to equation (1-15). Thus, 
there is no single formula involving only a single parameter from which every 
solution can be obtained. 13 This is not surprising since equation (1-15) can 
be written as 


i ( f ) 2+ ^- 1=0 (1 - 17 > 

and since only y 2 appears in this equation, it is clear that if y = f(x ) is a solution, 
so isy=— /(*;). 

Even though equations (1-16) are the only general solutions to equation 
(1-15), it is still not possible to obtain every solution to the differential equa- 
tion from these two formulas. Thus, equation (1-15) also possesses the two 
solutions 


y=+ 1 

y=— l 

which not only cannot be obtained from either of equations (1-16) but do not 
even satisfy the primitive equation (1-11). They are called singular solutions 
of equation (1-15) (see section 1.5) and are tangent to every solution obtained 
from the general solutions. 


12 For any positive real variable x , Vx will always denote the positive square root of x . 

13 Notice that we can combine the two eqs. (1-16) into a single formula y — (—1)" VI — (x — c) 2 for n = 0, 1 , 
but this solution involves the two parameters n and c. 
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We have shown how an nth-order differential equation can be obtained 
by successively differentiating an equation of the type (1-8). In fact if, for a 
given differential equation, we can find an implicit relation of the type (1-8) 
such that every solution of the differential equation is also a solution of this 
equation, we can for practical purposes consider the differential equation 
solved. Thus, the process of finding a solution to a differential equation might 
be thought of as reversing the process of obtaining a differential equation 
from its primitive. The first step of this inverse process when applied to the 
nth-order differential equation (1-6) (if it can be carried out) leads to a differ- 
ential equation of order n — 1 which involves a single arbitrary constant c. 
And if this equation can be solved for c, it can be written in the form 

H(x, y, y' , . . .,?<»-•>) =c (1-18) 

More generally, if for any given function H every solution of equation (1-6) 
satisfies an equation of the form (1-18) for some value of c, this function 
(and sometimes eq. (1—18) itself) is called a (first ) integral of equation 14 (1 _ 6). 
Geometrically, this means that an integral H is constant along every solution 
curve y—f (sc) of the differential equation. 

1.4 SOLUTIONS TO NORMAL EQUATIONS 

If we try to find a solution to equation (1-15), say 

y=f(x ) 

which satisfies the initial condition /(1)=0, we find from the first equation 
(1-16) that 


y=f(x) = VI— x? 


and from the second equation (1-16) that 

y=f(x) =— Vl — sc 2 


,4 Since integration is the inverse of differentiation, the preceding remarks show why the process of solving a 
differential equation is sometimes referred to as integrating the equation. 
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Therefore, equation (1-15) has two solutions which satisfy the condition 
y= 0 at x = l. However, we usually expect the solution of any first-order 
differential equation arising from a physical problem to be uniquely determined 
by a single initial condition. It is, therefore, necessary to find what conditions 
must be imposed on a given differential equation if our expectations are to be 
justified. To this end suppose that equation (1-6) can be solved for its highest 
order derivative. It can then be written as an equation of the form 

y<”> = G(*,y,y', . . .,y<"-») (1-19) 

Any ordinary differential equation which has the form of equation (1 19) is 
called a normal differential equation or is said to be in normal form. 

We are interested in finding conditions which ensure that equations of 
this type possess unique solutions satisfying the initial conditions 

x = xo,y=Yo,y' = Y l , . . .,y<"- 1 )=T„- 1 (1-20) 

where To, . . Y n -i are constants. The fundamental theorem for nth-order 
differential equations states that this is always the case, at least locally, 
provided that certain restrictions are imposed on the function G. In order to 
state this theorem in a precise way, let us for the moment treat the variables 
x, y, y', . . ., y (n_1) as independent. Then if there exists a positive number 8 
such that the functions 


r dG d^G d n ~ l G 
’dy’Sy 2 ’ • ' ' ’dy n - 1 

are defined and continuous 15 for all values of the variables x, 
y<»- 0 which lie in the range 


( 1 - 21 ) 


y* y'i 


• 9 


x— jc 0 | < 8, |y— yo| < 6, \y' — Fi| < 8, . . .,|y (n 1)_ F„_i|<8 


the fundamental theorem states that equation (1—19) possesses a solution 
y—f(x) which satisfies the initial conditions (1—20) and is defined on some 


15 Notice that only the partial derivatives of G with respect to the variables y, y\ . . . , y* 10 need be continuous, 
but the partial derivative with respect to x need not even exist. 
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interval containing the point x = xo, say 

a< x < b (1-22) 

In addition, this solution is unique. This means that if g(x) is any other solution 
to equation (1—19) which satisfies the initial conditions (1-20), then 

f(x) = g(x) 


for all x in the interval 16 (1-22). 

Notice that we have only asserted that the interval (1—22) exists; we have 
not given any idea of its size. 17 That is, there is no relation given between a, b, 
and 8. The interval might be exceedingly small. However, we shall give a 
measure of the size of this interval for an important special case. Thus, if the 
partied derivatives (1—21) are defined and continuous for all values of y, y', 
. . ., y (n-1) and for all values of x in some interval which contains the point 
x=xo, say 


a < x < (3 (1-23) 

and if there exists a single constant K such that 

for i = 0, 1, 2, . . .,n- 1 

for all y, . . ., y n-1) and all x in the interval (1-23), the solution y=f(x) 
which satisfies the initial conditions (1-20) exists and satisfies the differential 
equation for all x on the interval (1-23). 

For example, it is easy to verify that the equation 

y' = G (y ) = 1 + y 2 


has the solution y=tan (x;+c). Since tan x becomes infinite at x— ± tt/ 2, it 


18 Actually, the same conclusions can be reached even when somewhat weaker conditions are imposed on the func- 
tion G in eq. (1—19). Proofs of the fundamental theorem can be found in refs. 3 to 5. 

17 In fact, a measure of the size of this interval can be given in terms of certain bounds on the function G. See, for 
example, ref. 4, theorem 8, p. 118. 
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is clear that the solutions to this differential equation are defined only on inter- 
vals whose lengths are at most tt, even though G and dG/dy are defined and 
continuous everywhere. This occurs because there is no constant K which is 
larger than dG/dy = 2 y for all values of y. 

Although we normally expect the solution of an nth-order differential 
equation to be uniquely determined by specifying n initial conditions (or the 
equivalent), we have seen by example that specifying these conditions is not 
necessarily sufficient to uniquely determine the solution to every nth-order 
equation. The preceding discussion indicates that the extraneous solutions 
must arise either because the differential equation (l - 6) is not in normal 
form (i.e., solved in a unique way for the highest order derivative) or because, 
even if it is in the normal form (1-19), the function G is not sufficiently smooth 
at some points. 

A general solution to equation (1-19), if it exists, contains n “arbitrary” 
constants. And since, in principle, it is usually possible to solve n equations 
in n unknowns, this is consistent with the fundamental theorem which states 
that n initial conditions determine the solution to equation (1-19). 

For example, integrating the differential equation 

y"=* 2 (1-24) 


twice yields the general solution 


y=— +c,^+c 2 


(1-25) 


containing two arbitrary constants c\ and c 2 . The constants are uniquely 
determined by the initial conditions 


x = xo,y=Yo,y' = Yi 


since the two equations 


Fo =: y2 + ci*o+c 2 


Y t = 
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can be solved for c i and c 2 to obtain 


c,=r,- 


y3 

±o 

3 


c a = Yo-xoY 1 +^ 


More generally, suppose that by some formal process of integration we 
have found m general solutions 


y=fi(x,c u C 2 , . . ,,c n ) i=l,2, . . .,m ( 1—26 ) 

of the differential equation (1-19) and that every solution of this differential 
equation can be obtained from these formulas by properly choosing the con- 
stants. Then for every set of values of the n constants xo, Yo, Yt, . . .,Y n - 1 
for which G satisfies the conditions imposed in the fundamental theorem, 
there must be one and only one value of i for which it is possible to solve the 
n equations 


Yo — fi{xo, Ci, C 2 , . . -jCn) 


Yi=-jH x 0 , Cl, c 2 , 

ax 


• * Cn) 


Yn-t 


d'-'fi 

dx n ~ l 


(xo. Cl, C 2 , • • • , C yj ) 


for the n constants Ci, c 2 , . . .,c„. 

For example, the function 

y=f(x,c) = Vl + ce x (1-27) 

satisfies the differential equation 
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(1-28) 


for all values of c for which the radical exists. And for this equation the partial 
derivatives (1 — 21) are always continuous except when y= 0. Hence, the funda- 
mental theorem shows that there must be a solution which satisfies the initial 
condition 


x=0 y——\ 


However, there is no possible choice of c in equation (1—27) for which /( 0, 
c) = — 1, which shows that it is not possible to obtain every solution to equa- 
tion (1-28) from the formula (1-27). However, the function 

y= — Vl + ce x (1 — 29) 

also satisfies equation (1-28) for all values of c for which the radical exists. 
Equations (1-27) and (1-29) taken together will now provide a unique solution 
for each initial condition which does not involve y = 0. But these formulas will 
provide two solutions satisfying each initial condition involving y=0, where 
the partial derivatives (1-21) are not continuous. However, the fundamental 
theorem provides no information about solutions which pass through points 
where the partial derivatives (1-21) are not continuous. 

Specifying initial conditions is only one of many ways of determining the 
values of the arbitrary constants which appear in the general solutions of a 
differential equation. The most common alternative is to require that the 
solution and its derivatives satisfy certain conditions at both ends of some 
interval, say xi x =£ X 2 , within which this solution is being sought. These 
conditions are called boundary conditions . 18 

For example, we might require that the solution to equation (1-24) satisfy 
the boundary conditions 


18 The term “initial conditions” arose in conjunction with problems in mechanics which have time as the independ- 
ent variable and have conditions imposed at some initial time. The term “boundary conditions” arose in conjunction 
with problems involving physical distance as the independent variable with conditions imposed at the boundaries of a 
physical region. 
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y=yi atx = xi 

y— y 2 atx = %2 

Then substituting equation (1-25) into these conditions shows that 

yi — j2+cixi+c 2 

4 

y2 = j^+c 1 x 2 + c 2 

And since these equations have the unique solution 

yi — y2 | ~ x i 

Cl X 1 — X 2 \2(xi~ x 2 ) 

^ ^271—^172 . *1*2(*| — *?) 

C2 *2 — Xi 12 ( x 2 — Xi ) 

we conclude that the two boundary conditions uniquely determine the solution 
of equation (1—24). There are a number of theorems which give conditions 
which, if satisfied, will ensure that the solution of an nth-order equation will 
be uniquely determined by n boundary conditions. However, since these con- 
ditions are fairly complicated, we will not state any of these theorems herein. 

1.5 SOLUTIONS TO EQUATIONS NOT IN NORMAL FORM -SINGULAR SOLUTIONS 

We have shown that the nth-order normal differential equation possesses 
a solution which is uniquely determined by n initial conditions, provided that the 
conditions imposed in the fundamental theorem are satisfied. In order to use 
this result to obtain information about the solutions of a differential equation 
which is not in normal form, it is necessary to solve this equation, at least in 
principle, for its highest order derivative. 

First, consider the general nth-order equation of the first degree 

U(x,y, . . .,y»-«)y<»)-f F(*,y, . . .,y < "- 1) ) = 0 (1-30) 

This equation can be written in the normal form 
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y (n) =G(*,y, . . .,y (n_1) ) = — 


Y(x,y, ■ . 

U(x,y, . . .,y (n ~ 1) ) 


(1-31) 


for all values of x, y, . . . , y (n_1) for which U is not equal to zero. We shall 
suppose that U and V are defined and possess continuous first partial deriva- 
tives for all values of x, y, . . ., y (n_1) . Then the function G will possess con- 
tinuous first partial derivatives for all values of x, y, . . y (n-1) at which 

U t^O. Hence, the fundamental theorem shows that equation (1-30) will possess 
a unique solution satisfying any set of n initial conditions x = xo, y — yo, • • •, 
y(n-t)=Y n -i for which U(x o, Fo, . . Y„- 1 ) ¥= 0 . Each solution of equation 

(1-31) will satisfy equation (1—30); and if U is never equal to zero, every solu- 
tion of equation (1~30) will satisfy equation (1-31). Hence, in this case, equa- 
tions (1-30) and (1-31) are equivalent. However, suppose that the function U 
vanishes for certain values of its arguments. If equation (1-30) possesses a 
solution y— f(x) such that for certain values 19 of x 


U(x,f(x),...,f*-»(x)) = 0 (1-32) 


then y=f(x ) will not necessarily satisfy equation (1—31) for these values 20 
of x. It is called a singular solution of equation (1-30). 

For example, the first-order equation 

yy'_(y2_y)=0 ( 1~33 ) 

can be written in the normal form 

y ,=rLzy =y - x (1-34) 

y 

The normal equation (1-34) has the general solution 

y— 1 + ce x 


19 Thus y=f(x) satisfies both eq. (1-30) and the (n— l)st-order differential equation U(x, y, . . y^ n 1) ) == 0. 
20 Even if the numerator of G vanishes in such a way that it is possible to define G(x,f(x ) , . . . , f {n ~ l) (x ) ) at these 
values of x. 
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and this is also a general solution of equation (1-33). However, equation 
(1-33) also has the solution 


y= 0 


(1-35) 


but equation (1-34) does not. Since the coefficient of y vanishes at y= 0, 
equation (1-35) is a singular solution of equation (1-33). 

Now consider the general equation (1-6) and let us temporarily treat 
the quantities x, y, . . . , y n) as independent variables. Suppose that F has 
continuous first partial derivatives with respect to its arguments. Then the 
implicit function theorem (given in section 1.1) shows that equation (1-6) 
has several (possibly infinitely many) solutions of the form 

y n) = G(x,y , . . .,y-») ( 1 - 36 ) 

where the function G is continuous and has continuous first partial derivatives 
for all values of its arguments for which it is defined. One of these solutions 
will satisfy equation (1-6) for each set of values of x, y, . . ., y M for which 
equation (1-6) holds and 


y n) )^o (1-37) 

However, equation (1-36) is a normal differential equation; and therefore the 
fundamental theorem shows that it possesses a unique solution satisfying 
any set of initial conditions x = xo, y = To, • • ., y (n-1) = Y n - 1 for which G(x o, 
y 0 , . . . , Yn-i) is defined. All the solutions of the normal equations of the form 
(1-36) obtained from equation (1-6) will satisfy equation (1-6). If dF/d^ n) 
is never equal to zero for any values of x, y, . . ., y (n) for which equation 
(1-6) holds, every solution of equation (1-6) will satisfy one of the equations 
(1-36). Hence, in this case, equation (1-6) is equivalent to the set of normal 
equations of the form (1-36). However, suppose that 

^,(x,y,y' ,y«>)=0 (1-38) 


for certain values of x, y, . . ., y n) . If equation (1-6) possesses a solution 
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y=f (#) such that for certain values 21 of# 

(#,/(#), . . .,/ ( '* ) (#)) = 0 (1-39) 

then for these values of # the implicit function theorem no longer guarantees 
that there will be normal equations (satisfying the conditions imposed in the 
fundamental theorem) which are satisfied by the solution y=/(#) of equation 
(1—6). This solution is, therefore, called a singular solution of equation (1 — 6). 
For example, the equation 

F(y,y') = y' 2 +l-y=0 (1-40) 

has the two solutions 

y' = Gi(y ) = Vy— 1 (1-41) 

y=G 2 (y) = — Vy— 1 (1-42) 

where G\ and G 2 have continuous derivatives for all values y, y' for which equa- 
tion (1-40) holds and for which 



(1-43) 

Now equation (1-41) has the solution 


y= 1 + -^- (# — c) 2 for# S 5 c 

( 1 — 44) 

and equation (1-42) has the solution 


y= 1 + -^- (# — c) 2 for#<c 

(1-45) 


21 Thusy=/U) satisfies both the differential eq. (1-6) and the differential eq. (1-38). 
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Figure 1-6. —Solutions of equations (1-41) and (1-42). 


For each value of c, these two solutions are the two branches of the same parab- 
ola. They are shown in figure 1-6. The solutions of equation (1—41) are shown 
as solid curves. The solutions of equation (1-42) are shown dashed. Notice 
that one solution to equation (1-41) and one solution to equation (1-42) 
pass through each point (such as point A) lying above the line y= 1, which is 
consistent with the fundamental theorem. 

Equation (1-43) shows that the points where y' = 0 are exceptional. 
But, for those values of y and y which satisfy equation (1-40), this can occur 
only when y=l. However, when y=l, dGi/dy and dGddy become infinite; 
and therefore the fundamental theorem does not apply along this line. Equa- 
tions (1—41) and (1—42) (and, therefore, also eq. (1—40)) possess the solution 


y=l 


which lies along this line and which cannot be obtained for any choice of c 
from either equation (1-44) or (1-45). Thus, y— 1 is a singular solution. 
Notice that there are three solutions of equation (1-40) passing through each 
point on the line y— 1. 

Now consider the first-order differential equation 

F{x,y,y') = 0 (1-46) 

The singular solutions of this equation, if they exist, must also satisfy the 
equation 
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^7(*,y,y')=0 (1-47) 

When y' can be eliminated between equations (1-40) and (1-41), we obtain 
the equation 


g(x,y)= 0 (1-48) 

Since the singular solutions simultaneously satisfy equations (1-46) and 
(1-47), they must also satisfy equation (1-48). Thus, every singular solution of 
equation (1-46) can be found by solving equation (1-48). However, the con- 
verse is by no means always true. Since the symbol p is often used to denote 
y' , equation (1-48) is sometimes called the p-discriminant equation and the 
curve described by this equation is sometimes called the singular locus. 
When attempting to find all solutions of a first-order equation, the solutions to 
the p-discriminant equation should be checked to see if they also satisfy the 
differential equation. 

For example, consider the first-order quadratic equation 
F(x, y, y') — Ay' 2 + By' + C = 0 

where A, B, and C are functions of x and y. Upon eliminating y' between this 
equation and the equation 


dF_ 

by' 


= 2Ay' + B = 0 


we find that the p-discriminant equation is 


B 2 — 4AC = 0 


Thus, for the first-order equation 


F(x, y, y') = xy' 2 — Syy' +9x 2 = 0 (1-49) 


the p-discriminant equation is 


y 2 = 4* 3 
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which has the two solutions y=± 2* 3/2 . Direct substitution shows that both 
these solutions satisfy equation (1-49); and, therefore, they are both singular 
solutions. 

1.6 LINEAR EQUATIONS 

In this section we present a number of important special properties which 
are possessed by the solutions of linear equations. The general form of the 
nth-order linear equation is given in equation (1-7). If fi(x) and fz(x) are 
any two solutions to a linear homogeneous differential equation and C\ and 
C 2 are any constants, the function 

y=cifi(x) + c/ 2 U) (1-50) 

is also a solution. This linear superposition principle is extremely important 
in analysis. It can easily be extended to include linear combinations of any 
number of solutions. Notice that the homogeneous equation always possesses 
the trivial solution y = 0. 

The equation obtained from a particular linear equation of the form (1-7) 
by setting b(x) equal to zero is called the associated homogeneous equation 
of this equation. Any solution to the associated homogeneous equation is 
called a homogeneous solution of the equation. If f(x) is any solution of equation 
(1-7) and if f H (x) is any homogeneous solution of this equation, the function 

f(x) + cf H (x) 

is also a solution. Of course, the superposition principle holds only for the 
homogeneous solutions of a linear equation. 

Notice that the linear equation (1-7) can always be written in the normal 
form (1-19) with 


C = _£i y (n-1)_ 

a 0 


dn-i 

do 


t a n , 

y y— b 

do 


(1-51) 


for all values of x where the coefficient ao(«) is not equal to zero. It follows 
that for these values of x 


dG _ On-j(x) 
dy® ao(x) 


for 7 = 0,1,2, . . .,n — 1 


(1-52) 
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Now suppose that the coefficients a<>, ai, . . a n , b of equation (1-7) are 
all continuous in some finite 22 interval 


a^x^/3 (1-53) 

Then if ao(x ) does not vanish at any point of this interval, it will always be 
possible to find a constant K such that 


_««-£(*)■ for/=l,2,3, . . .,n-l 

a 0 (x) J 

and for all x in the interval (1-53). It can also be seen from equations (1-51) 
and (1-52) that the partial derivatives (1-21) are continuous for all values of 
y\ • • and all values of x in this interval. Hence, in this case, G 

not only satisfies the conditions imposed in the fundamental theorem, but it 
satisfies the more restrictive conditions given immediately following it. 

We therefore conclude that for Unear equations the fundamental theorem 
can be stated in the following way: 

Let the coefficients ao(x), aj(x), . . ., a n (x), b(x) of equation (1-7) be con- 
tinuous on some interval a < x < /3 which contains the point xo, and suppose 
that the function ao(x) does not vanish in this interval. Then for any real numbers 
Yo, • • Y n there is a unique solution y = f(x) of equation (1-7) satisfying the 
initial conditions 


y(x<>) = Y 0 ,y'(x 0 ) = Y u . . ., y^ n ~^(x 0 ) = Y n . 1 

And this solution satisfies the differential equation on the entire interval 
a < x < {1. 

If the linear equation (1—7) has coefficients which are continuous in some 
interval, the points in that interval where the coefficient a«(x) vanishes are 
called singular points of the equation. We shall have more to say about such 
points subsequently. 

A set of functions gi(x), g^(x), . . ., g n (x) is said to be linearly inde- 
pendent in an interval if it is impossible to find constants ci, . . . , c n which 
are not all zeros such that the expression 


22 This means that neither a nor 0 is equal to infinity. 
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Clg\(x) + C2g 2 (x) + . . . + c„g„(x ) 

is equal to zero at all points (i.e., identically) of the interval. A set of functions 
which is not linearly independent is said to be linearly dependent. 

If the set of functions gi(*), . . gn(x) is linearly independent, no one 
of these functions can be expressed as a linear combination of the others. 
Hence, if a,- ¥= 0, it will not be possible to express the function 

f{x) = aigi(x)-\-a 2 g 2 {x)+ . . . + a„g n (x) (1-54) 


in the form 

f(x)=eigi(x) + e 2 g 2 (x)+ . . . +ei-igi-i(x) + e i+ igi +1 (x)+ . . . + e„g n (x) 

in which the function gi( x) no longer appears. Thus the constants appearing 
in equation (1-54) are independent. 23 

For example, the functions gt(x)=2x — 5 and g 2 (x) =6x — 15 are not 
linearly independent since 


Clgl (x)+C2g 2 (x)—0 

when ci = 3 and c 2 =— 1. However, the functions 

gi(x) = 2x — 5 and g 2 {x) = 2x-\-S 
are linearly independent. 

Let gi, g 2 , . . ft be a set of (n — 1) -times continuously differentiable 
functions. The determinant 


g l g2 • • • gn 

gl g 2 • • • • gn 


W{gl,g 2 , . . -,gn) = 


. . . sir 1 * 


23 This means that eq. (1-54) is not expressible in terms of fewer than n constants. 
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is called the Wronskian of these functions. Now suppose that the functions 
gu g 2 , . . g n are linearly dependent on some interval. Then there exist 
constants Ci , C 2 , . . . , c n not all zero such that 

C\g\{x) +C2g2(x) + . . . +Cngn(x)= 0 

at every point of the interval. Hence, we can differentiate this expression 
n — 1 times to obtain 

C\g[(x) + Czg' 2 {x) +• • •+ C n g' n (x ) =0 


+ Czff?~ l) {x) + . . . + ^(x) — 0 

This gives us a set of n linear equations for the n nonzero constants 
Ci, C 2 , . . ., c n . If these equations are to be solvable for the constants at all 
points of the interval, the Wronskian must vanish at all points of this interval. 
Hence, we can conclude that if the set gi, gn of (n — 1 )-times continu- 

ously differentiable functions is linearly dependent on the interval a =£ x =£ /3, 
the Wronskian of these functions must vanish at all points of this interval. 

Another way of saying this is that if the Wronskian of a set gi, &, . . ., g„ 
of (n — 1 continuously differentiable functions does not vanish at every 

point of the interval a =£ x *£ /3, these functions are linearly independent on this 
interval. 

Thus, the Wronskian of the functions (discussed in the previous example) 
gi (x) = 2x — 5 and gz (x) = 6x — 15 is 

2x — 5 6x — 15 

W(gu gz) = 

2 6 

= (2x — 5)6 — 2(6x — 15) =0 

And this shows that these functions are linearly dependent. However, the 
Wronskian of the linearly independent functions gi(x) = 2x — 5 and gz{x) = 
2x+ 5 is 
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2x — 5 2x + 5 

W(g u g*) = =-20 

2 2 

which is certainly not equal to zero. 

Now consider the associated homogeneous equation of equation (1-7) 
aoy (n) + a\'f n ~ l) + . . . + d n -\y' + dnj = 0 (1-55) 


and suppose that the coefficients ao, a \ , . . . , a n are continuous in the interval 


a =£ x =£ /3 (1-56) 


and that ao(x ) does not vanish on this interval. We shall now show that any n 
solutions fi(x), . . ., f n (x) of equation (1-55) are linearly dependent on the 
interval (1-56) if, and only if, their Wronskian W(fi, it, . . ., f n ) vanishes at 
every point of this interval. 

Another way of saying this is that the n solutions fi(x), . . ., f n (x) of 
equation (1-55) are linearly independent on the interval (1-56) if, and only 
if, the Wronskian W(fj, . . ., f n ) is not equal to zero at some point of this 
interval. 

We have already shown that any set of (n — l)-times continuously differ- 
entiable functions is linearly dependent on an interval only if their Wronskian 
vanishes at every point of this interval. Hence, it is certainly true for a set of 
n solutions. 24 In order to show that, conversely, the vanishing of the Wronskian 
implies that the solutions are linearly dependent, suppose that W(fufz, . . ., 
f n ) vanishes at every point of the interval (1-56) and let be any point of this 
interval. Then the Wronskian certainly vanishes at this point, and therefore 
the system of n equations 


24 Recall that the definition of a solution requires that it be n — 1 times continuously differentiable. 
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C\fi(x 0 ) + Czfo(xo) + . . . + C n f n (xo)=0 

cJ\(xo) + Czf 2 (xo) + ... + c„f n (x o) = 0 

(1-57) 


Ci/^-^Uo) + c 2 /il , - 1) (*o) + . . . +c„/ < "- 1) (*o)-0' 

has a solution Ci = Ci, c 2 =c 2 , . . ., c n =c n such that the constants Ci, c 2 , . . 
c n are not all zeros. Hence, we can define a function f(x) by 

f(x)=Sxfi(x)+c z f 2 (x)+ . . . +?*fn(x) (1-58) 

Then it follows^ from the linear superposition principle that f(x) is a solution to 
equation (1-55) in the interval (1-56). And differentiating equation (1-58) 
n — 1 times shows that 

f =Cxf I + c/ 2 -t- . . . + Ctifn 

= + crfi + . . . + Crfn 

• • • « 

• • • • 

• ■ • • 

/ (n ' 1) = e,/ ( i n_1) — c 2 /(j n_1) + . . . +6„/<r 1) 

Hence, upon setting x = xo in this system and using equation (1-57), we find 
that the solution / satisfies the n initial conditions 

f(xo)=f'(x 0 )= . . . =f {n ~ 1) (x o)=0 

Since the trivial solution of the homogeneous equation (1-55) also satisfies 
these conditions and the fundamental theorem shows that there is only one 
such solution, we therefore conclude that f(x) is equal to zero at every point 
of the interval ( 1-56) . Therefore, 

Clfl(x) +e 2 / 2 (*) + . . . +C n f n (x) = 0 
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at every point of the interval (1-56). But since the constants Ci, C 2 , . . 
c n are not all zero, this implies that the set /i,/ 2 , . . .,/« is linearly dependent 
and therefore proves the assertion. 

We shall now show that the linear homogeneous equation (1-55) always 
has n linearly independent solutions on the interval (1-56). To this end notice 
that for any point xo of this interval the fundamental theorem shows that equa- 
tion (1-55) has n solutions f\(x) , f-z(x) , . . f n (x) which satisfy the initial 
conditions 


fi(xo) — l 

//(*<>) =/r(*o)= . . . =A n -»(xo)=o 


and for r— 2, 3, . . ., n 

M* 0) =//(*«)= • • • =/? r - 2) (*o) = o 

fr-D( Xo ) = l 

fi r Kxo)=F/ +1 Kxo)= • • o)=0 


It is easy to see from these conditions that the Wronskian of these n solutions 
has the value unity at the point * 0 . Thus, it is not zero at every point of the 
interval a x ^ /3, and we can therefore conclude that these n solutions are 
linearly independent. 

For example, the second-order differential equation 


&2L 

dx 2 


y=0 


has the two solutions y— sin x and y = cos x. 
The Wronskian of these solutions is 


>F(sin x, cos jc) = 


sin x cos x 


cos x sin x 


= — sin 2 x — cos 2 x=— 1 #0 


30 


INTRODUCTION 


Hence, the solutions are linearly independent. 

If iu U, • • f n are any n linearly independent solutions of equation 
(1-55) on the interval (1-56), and if h(x) is any other solution of equation 
(1-55) on this interval, there exist constants Ci, c 2 , . . c„ such that 

h(x) = C\f\(x) + C%f 2 (x) + . . . + Cn/n(*) (l - 59) 

on the entire interval. 

This means that every solution to the equation on this interveil can be 
obtained from the general solution 

f(x,c u c 2 , . . .,c n ) =C\f\(x) +C 2 / 2 OO+ • • . +C n fn(x) 

In order to prove this assertion, notice that since f\, . . . , /„ are linearly 
independent, W (f\, f 2 , . . ., f n ) cannot be zero at every point of the interval 
(1-56). Hence, there exists a point, say x 0 , for which the Wronskian is not zero. 
The n equations 

h(x 0 ) = C\fi(x<>) + C^f 2 (x o)+ . . . +Cifn(x o) \ 

h (xo) <— C\fi (jfo) 0 ) • • • + Crf n (Xo) / 

> ( 1 - 60 ) 

• • • . I 

• • • 1 

A(»-‘)Uo)=c 1 yi”-»Uo) + ^ n - 1) Uo)+ . . . +c/t»-»)Uo) J 

can therefore be solved for the n constants Ci, C 2 , . . . , c n . Hence, the linear 
superposition principle shows that the function g(x) defined by 

g(x) = C x f l (x)+Csf 2 (x)+ . . . +Crfn(x) (1-61) 

is a solution to equation (1-55) on the interval (1-56). And it follows from 
equation (1-60) that 

h(xo) =g(x 0 ), h' (xo) - g' (xo) , . . ., A (n - 1) Uo)=gr ( ” _1) Uo) 

But since the fundamental theorem shows that there is only one solution 
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satisfying a set of n initial conditions, we can conclude that g(x) = /i(:x;) for 
all x on the interval (1—56). Equation (1—61) now shows that equation (1-59) 
holds, and this proves the assertion. 

It is easy to see from these results that equation (1-56) cannot possess 
n+ 1 linearly independent solutions on the interval (1-56). 

Now consider the general linear equation (1-7) and suppose that the 
coefficients ao(*), a.\{x), . . ., a n (x), b{x) are continuous on the interval 

a=Sx=£/3 (1-62) 

and that ao(x) does not vanish at any point of this interval. Let fo(x) be any 
particular solution of equation (1-7); and let /i, / 2 , . . . ,/ n be any n linearly 
independent solutions of the associated homogeneous equation. Then if h(x) 
satisfies equation (1-7) on this interval we can find n constants Ci, . . ., c n 
such that 

h(x) = cfx(x)+ . . . +Crfn(x)+fo(x) (1-63) 

on the entire interval. 

This means that every solution on the interval can be obtained from the 
general solution 


f(x,Ci,C 2 , . . ., C„)=C ] f l (x) + C 2 f 2 (x) + . . . + Crfn(x) + fo(x) ( 1 - 64 ) 

In order to prove this assertion, notice that h(x) —fo(x) is a solution to the as- 
sociated homogeneous equation of equation (1-7). Hence, we can find constants 
ci,C 2 , . • . ,Cn such that 

h(x)— Mx) = cfx(x)+c z f 2 (x)+ . . . +Ctfn(x) 

This shows that equation (1-64) holds and proves the assertion. 

The general solution of the associated homogeneous equation 

c\fi(x)+c 2 f 2 (x)+ . . . +c n f n (x) 

which appears in the general solution (1-64) of the nonhomogeneous equation 
(1—7) is called the complementary function. And any set of n linearly inde- 
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pendent homogeneous solutions is known as a set of fundamental solutions 
to the homogeneous equation. 

For example, the complementary function of the second-order equation 


d 2 y 

dx 2 


y=x 


(1-65) 


is Ci cos x + c 2 sin x where Ci and c 2 are arbitrary constants. A particular 
solution to this equation is y=x. Then 


y= Ci cos x + c 2 sin x + x 

is a general solution of equation (1-65), and any other particular solution of 
this equation can be obtained by assigning particular numerical values to the 
constants Ci and c 2 . 
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CHAPTER 2 


First-Order Equations 


In this chapter a number of elementary techniques for obtaining explicit 
solutions for certain types of first-order equations are developed. These 
equations are chosen for consideration because they are simple enough to 
be solved explicitly and, at the same time, occur frequently enough in practice 
so that they are worth solving. As it happens, these equations are all of the 
first degree. 

2.1 SYMMETRIC NOTATION FOR EQUATIONS OF FIRST DEGREE 

The first-order differential equation of the first degree is an equation of the 
form 


y'N{x,y) +M(x,y) — 0 (2-1) 


We shall suppose that M and N possess continuous first partial derivatives in 
some domain D of the x, y plane. If N(x, y) were nonzero at every point of 
D , we could write equation (2-1) in the normal form 

✓ ( 2 - 2 ) 

and the fundamental theorem would guarantee that this equation have a 
unique solution passing through each point of D. Points of D where N(x, y) = 0 
are called singular points of the equation. 

In equation (2-1), y is considered as the dependent variable. The solutions 
of this equation are functions of the form 
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y=f(x) 


(2-3) 


On the other hand, the equation 


^M(x,y)+N(x,y) = 0 

ay 


(2-4) 


which is closely related to equation (2 — 1) possesses solutions which are func- 
tions of the form 


x=g(y) 


(2-5) 


The singular points of this equation occur when M(x, y) = 0. 

However, in any domain in which neither equation (2-1) nor (2-4) has a 
singular point, any solution (2-3) of equation (2-1) can be solved for x to obtain 
a solution of the form (2—5) of equation (2—4), and conversely. Thus, for many 
purposes, it is not necessary to distinguish between equations (2-1) and (2-2). 
When this is the case, we can introduce the symmetrical notation 


M(x,y)dx + N(x,y)dy=0 (2-6) 


If M{x, y) and N(x, y) are both not zero, we understand this notation to 
denote either equation (2-1) or equation (2-^4). When M(x,y) =0 but N(x,y) 
¥> 0, the existence of the solution (2-3) of equation (2-1) is guaranteed by the 
fundamental theorem, but that of equation (2-4) is not. 25 Hence, in the neigh- 
borhood of a point where M(x,y)= 0 we understand the notation (2-6) to 
denote equation (2-1). Similarly, when N(x, y) =0 but M(x, y) ^ 0, we under- 
stand the notation (2-6) to denote equation (2-4). The fundamental theorem 
does not guarantee that either equation will possess solutions at points 
where M(x,y) and N(x,y ) are both zero, and we therefore say that such 
points are singular points of equation (2-6) . 


2S At such points, dxldy = and dy/dx = Q; hence, dx/dy is not continuous. 
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2.2 EXACT EQUATIONS 

The first-order differential equation of the first degree 

M(x,y)dx + N(x,y)dy=0 (2-7) 

with M and N defined and continuous in some domain D, is said to be exact 
in D if the line integral 

^ [M(x, y)dx + N(x, y)dy] 

has the same value for all paths of integration which lie in D and which have the 
same end points. This is equivalent to requiring that the integral vanish around 
every closed path within D. 

It is shown in books on calculus that the differential equation (2-7) is 
exact if, and only if, there exists a continuously differentiable function <f>(x, y) 
such that 26 


M = ^ and N=^- (2-8) 

ox dy 

Then the function <£, which is defined only to within an additive constant, 
is given by 


f(x,y) 

<j>(x,y) = <t (y u+Ndy) + constant 

J(xo,yo) 


(2-9) 


where (*o, Jo) is any fixed point and the integral is carried out over any curve 
T within D which joins the fixed point ( xo , yo) with the variable point (x, y). 

Suppose that equation (2-7) is exact. Then it follows from equation (2-8) 
that the total derivative of <f> with respect to x along any direction is 

Similarly, the total derivative with respect to y is 


^Alternatively, we can say that M dx + N dy is an exact differential of the function <f> or that d<f> — M dx + N dy. 
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d$ ... d<£ . d<M* _ , r . dx 
dy dy dxdy dy 

Since the differential equation (2-7) denotes either the equation 


(2-11) 


M+y'N= 0 


or the equation 


N + -f-M=0 
dy 

equations (2-10) and (2-11) show that (f> is constant along every solution 
curve of equation (2-7). Hence, $ is an integral of equation (2-7). However, 
for a first-order equation, finding its integral is equivalent to finding its solu- 
tion in implicit form. Thus, an exact equation can effectively be solved simply 
by carrying out the integral (2-9) along any convenient path. 

It is easy to see by differentiating equations (2-8) and interchanging the 
order of differentiation that, if M and N are continuously differentiable, a 
necessary condition for equation (2-7) to be exact is that 


dM = dN 
dy dx 


( 2 - 12 ) 


Now suppose that the domain D has no holes. That is, it is similar to the 
domain shown in figure 2-1 (a) but not the one shown in figure 2-1 (b). Such a 
domain is said to be simply connected. 

Let T be any closed curve within D and let R be the region enclosed by 
T, as shown schematically in figure 2-2. Then Green’s theorem 

I [Mdx + Ndy]= If )dxdy 

R 


shows that condition (2-12) is also a sufficient condition for exactness. If D 
were not simply connected, condition (2-12) would not be sufficient to ensure 
that the line integral will always vanish along a closed path such as T in figure 
2 - 2 . 
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(a) (b) 


(a) Domain has no holes, (b) Domain has a single hole. 
Figure 2-1.— Singly and multiply connected domains. 



FIGURE 2—2. — Path of integration within D. 

We shall always suppose that the conditions of differentiability and con- 
nectedness given in the preceding paragraphs are satisfied and therefore that 
equation (2-12) is the necessary and sufficient condition for equation (2-7) 
to be exact. 

For example, consider the equation 


2y±£ 

x 2 + y 2 * 2 + y 2 


dy= 0 
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Then 


and 



Hence, equation (2-12) holds and the differential equation is exact. Therefore, 
carrying out the integral (2-9) along the path depicted in figure 2-3 shows that 


2y+sp 


ft — 2 dy+ 

co + y 


2s — y 
s 2 + y 2 


= ln(x 2 + y 2 ) — tan -1 — + constant 

y 


is an integral of the equation. 

Unfortunately, the majority of first-degree equations encountered in 
practice are not exact. It is therefore natural to ask whether there exists a 
function \(s, y) such that the equation 


dx + N dy) = 0 


(2-13) 
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(obtained by multiplying both sides of eq. (2-7) by X) is exact. It can be shown 
(ref. 3, p. 27) that such a function always exists, provided only that equation 
(2-7) possesses exactly one general solution. It is called an integrating factor. 
Integrating factors are certainly not unique for, if X is an integrating factor of 
a given first-order equation, so is cX for any constant c. 

Now if X is an integrating factor for equation (2-7), the condition (2-12) 
for exactness must hold for equation (2—13). Hence, 

d(kM) ^ d(kN) 
dy dx 


or 


„,dX » r dX , v ( dM dN \ n , ms 

M aj- N te +X { W~^) ° (2_14) 

Thus, X is the solution of a linear partial differential equation (see section 3.3) 
which is usually more difficult to solve than the original ordinary differential 
equation. However, since it can be shown that every solution of equation 
(2—14) is an integrating factor of equation (2-7), it is only necessary to find a 
single particular solution to equation (2—14). 

Although, in general, finding an integrating factor is quite difficult, it is 
sometimes possible to accomplish this simply by inspection. For example, 
upon multiplying the equation 

L/C*) + y]dx — x dy = 0 

by 1/x 2 we obtain the equation 

[/(* ) + y ] dx - i dy — 0 

And since d/dy\_(f+y)lx 2 ] = 1/x 2 and d/dx(— 1/x) = 1/x 2 , this equation is 
exact. Then carrying out the line integral (2-9) shows that the differential 
equation possesses the integral 



fix) 

X 2 


dx 
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2.3 EQUATIONS OF THE TYPE g(y)y' =/(*) + h(x)G J /(x)dx-J g(y)dyj 

Many of the differential equations which can be solved by classical 
methods are special cases of the differential equation 


~g(y)dy+[f(x)+h(x)G ( J f(x)dx-J g(y)dyj 


dx = 0 (2-15) 


where g(y) ,f(x) ,h(x), and G can be any functions of their arguments. In order 
to obtain a solution to this equation put 


U = J f(x)dx — J g(y)dy 

Then dU= f(x)dx—g(y)dy and equation (2-15) can be written as 


dU+h(x)G(U)dx = 0 


It is easy to see that 1 IG{U) is an integrating factor for this differential equation 
and therefore that the equation 


-^jjj-\-h(x)dx = 0 

is exact. Hence, it follows from equation (2-9) that 

rj f(x)dx- S g(y)dy j f 

</>=! G(UjdU~*~J h ( x )dx = constant 


(2-16) 


is an integral of equation (2-15). 

We now show that many of the first-order normal equations, which are 
solvable by the classical methods, can be obtained by specializing the functions 
/, g , h, and G in equation (2-15) and, therefore, that the solutions of these 
equations (actually the integrals) are all given by equation (2-16). 


42 


FIRST-ORDER EQUATIONS 


First put G(U) = 1, and A (*)=(). Then equation (2—15) becomes the 
general separable equation 


—g(y)dy+f(x)dx = 0 

and equation (2-16) shows that its integral is simply 

</>= J /(x)dx- J g(y)dy 

Now put G(U) = (al(3)e ula H(e~ u ) — 1, g(y)=oc/y, f(x) — h(x)=^/x. 
Then equation (2-15) becomes 

dy=x^-m(^jdx (2-17) 

which is the isobaric (or one-dimensional ) equation. In this case, the integral 
(2-16) becomes 


/ 


in(afiltg) dU 


a 


— In x& = constant 


^ e ula H(e~ u ) — 1 

r 


or 


/ 


UPlxP) 


dV 


— In x& = constant 


a 


^ yt (j/)_ V 


If we put ot = (3= 1 in equation (2—17), we obtain the homogeneous equation 

dy=H(^jdx (2-18) 

And equation (2—16) shows that its solution is 

'vlx dV 


P 


H{V)-V 


— In x = constant 
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In order to recognize whether a given differential equation has the form 
(2-17), it is necessary to be able to determine whether any given function 
f) ( x , y) can be expressed in the form 

(£) (2-19) 

A necessary and sufficient condition that 17 (x,y) can be expressed in this form 
is that there exist a number p such that 

7 1 * p y) = t p ~ 1 V (*> 7 ) ( 2 - 20 ) 


for all values of t. 

In order to show that a function which satisfies the condition (2—20) can 
always be expressed in the form (2-19), put t=\jx in equation (2-20) to get 

V =* ,_p i7 (*>r) 

Now choose a number a and define the number /8 and the function H (U) by 
/J = pa and H(U) =7)(1, U lla ), respectively. Then 

which shows that 17 (x, y) can be expressed in the form (2—19). 

For example, replace y by yt I/a and x by xt in the function 


V (*> y) = 


y(jc — y^a In y + y° In a) 
x 2 (a In y — In x) 


to show that 


r)(xt,yt lla ) = 6 x ~ a)la 7) (x,y) 

Hence, this function satisfies condition (2—20), and it can be put in the form 
(2- 19) by introducing the variable y°lx to obtain 
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y(x, y ) = xf 1/a) ~ 1 



In the important special case where a = (3 (corresponding to the homogeneous 
eq. (2-18)), condition (2-20) reduces to 

r)(tx,ty) = ri(x, l y) (2-21) 

A function of two variables which satisfies condition (2—21) is said to be homo- 
geneous of degree zero, which, of course, accounts for the name given to 
equation (2-18). More generally, a function of the n variables Xi, Xi, . . ., x n , 
say 7)(xu X2, . . ., x n ), is said to be homogeneous of degree k in the variables 
Xi, X 2 , . . ., x n if it satisfies the condition 


7}(txi, tx 2 , . . ., tXn ) = t k 7)(x 1, Xi, . . ., X n ) 


( 2 - 22 ) 


for all values of t. 

Another well-known special case of (2—15) is obtained by putting 
G(U) = e- (fc - 1){/ , /*(*) = »M*) exp ^(A: — 1) J /(*)cfocj, and g(y) = 1/y 


to obtain Bernoulli’s equation 

dy = [f{x)y + tft(x)y k ]dx 
It follows from equation (2-16) that its solution is 


(2-23) 


exp 1) J /(x)ch:j 


(k- l )^- 1 


+ 


j ifj(x) exp ^(A: — 1) J /(jc)cbcj dx = constant 

(2-24) 


When k = 0, equation (2—23) reduces to the first-order linear equation 


dy= [f(x)y+ tji(x)]dx (2-25) 
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And equation (2-24) shows that its solution is 

y=e Sf (x)dx * , fj(x)dx + constant e SfMdx 

2.4 RICCATI EQUATION 

The Riccati equation 

^ =/(*) + g(x)y + h(x)y 2 (2-26) 

can be thought of as a generalization of the linear first-order equation (2—25). 
This equation is sometimes called the generalized Riccati equation since 
Riccati actually studied the special case 

~j~ + by- — cx m 

ax 


where b and c are constants. 

If a particular solution y x of equation (2—26) is known, the general solu- 
tion y of this equation is 


, 1 

y = y + v 

where U is the general solution of the linear equation 

+ (g + 2y\h)U + h = 0 

which was solved in section 2.3. This follows from the relation 

f-/- 4y -V=f-/-«y,-Ay?-i(f + *£/+A + 2y^) 
= ~w\% + {g+2y ' h W +h \ 


= 0 
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Even if a particular solution is not known, it is still possible to reduce 
the problem of solving equation (2-26) to the problem of solving a linear equa- 
tion; but in this case the equation is of second order. In order to accomplish 
this put 


U = e ~ f yh(x)dx 


Then U' / U — — yh and 

U" = - yhlT ~ y'hU - yh'U = (y^h 2 - y'h)U - yh'U 

= — (f + gy)hU — yh'U 

= -fhU-y(gh + h'W 



Hence, U satisfies the second-order linear homogeneous differential equation 

U"~(g+J ) U'+JhU= 0 

The converse of this statement is also true. That is, every second-order linear 
homogeneous equation can be transformed into a Riccati equation. 

Since linear second-order equations will be discussed extensively in 
chapter 6, we shall not consider the Riccati equation further here. 
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Systems of Equations 


In this chapter, a number of important properties of systems of differential 
equations are discussed. We shall also show how the solutions of these systems 
can be used to obtain solutions to certain types of first-order partial differential 
equations. 

3 . 1 . FIRST-ORDER NORMAL SYSTEMS 

In certain applications it is necessary to deal with systems of simultaneous 
differential equations. The set of n first-order normal differential equations in 
the n dependent variables yi, y 2 , . . ., y n 


dy i 
dx 

dy 2 

dx 


dyn 

dx 


= Gi(*, y u y 2 , . . ., y„) 
= G 2 (x, y u y2, . . yn) 


— Gn(x, yu y2* • • •» yn) 


( 3 - 1 ) 


is called a normal system. It provides a standard form to which all normal 
differential equations and all systems of normal differential equations can be 
reduced. For example, by introducing the new dependent variables yi defined 
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by 


d}y 

yi+1 = 5? 


for i = 1, 2, 


n — 1 


the nth-order normal equation (1 — 19) can be transformed into a first-order 
normal system 


dy t_ 
dx 


y-2 



dyn-i 

dx 


yn 


-j£ = G(x, y u y 2 , . . y„) 

However, it is by no means always possible to transform a first-order normal 
system into a single nth-order equation. 

In order to obtain a geometric interpretation of the system (3-1) and its 
solutions, it is again convenient to think of the variables x, yi, y 2 , . . ., y n as 
being the coordinates of an (n + l)-dimensional space. The definitions of a 
domain and of a neighborhood in this space were given in section 1.1. Based on 
these ideas, the fundamental theorem for the first-order normal system can be 
stated as follows: 

Suppose that the functions Gj/or i= 1,2, . . . , n are defined and continuous 
in some domain D and that the partial derivatives 


dGt 

dyj 


for 1 =£ i *£ n; 1 =£ ) s£ n 


are continuous in D. Then for each point (xo, y?, y”, • • ., y^) of D equation 
(3~1) has precisely one solution 
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y i = /i(«) 

y 2 =f 2 (x) 

. ' (3-2) 

Jn = fn(x) 

which is defined on some interval a < x < containing the point xo and satisfies 
the initial conditions 

fi(x 0 ) = y° 1 , Mxo)=yl, . . . ,fn(x 0 )=y % (3-3) 

The solutions of the system (3-1) can be visualized as curves in the 
(n + l)-dimensional space. Then the fundamental theorem states that there is 
precisely one such curve passing through each point of the domain D in this 
space. 

If H(x, yi, y 2 , . . ., y„ ) is a nonconstant function such that every solution 
to the system (3-1) satisfies an equation of the form 27 

H(x,yi,y 2 , . . ., y„) = constant (3-4) 

(the constant may be different for different solutions), then H(x, yi, y 2 . . ., y n ) 
is called an integral (or first integral) of the system (3-1). Thus, an integral of 
the system (3—1) is a function H which is not identically constant but is constant 
along each solution curve of the system. Therefore, upon differentiating H 
along any solution curve of the system (3—1) we obtain 


dH_ 

dH . 

dH_dy i 

dH dy n 

dx 

dx + 

dyi dx 

dy n dx 


dH 

dH 

dH ^ „ 


dx + 

r— Gi+ . . 
dyi 

■ + ^ G ” =0 


27 Notice that this definition of an integral is consistent with the one given in chapter 1 for the nth-order normal 
equation when the latter equation is transformed, by the procedure just described, into a first-order normal system. 
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provided that H possesses continuous first partial derivatives. But each point 
of D lies on a solution curve of the system (3-1). Hence, H satisfies the equation 


dH dH r dH r 

~Z r -r Lr 1 ■+■ T tr2 t . 

dx oyi oy 2 


. + |^G n = 0 

dy n 


(3-5) 


at every point of D. Conversely, if H is any solution to equation (3-5) in D, it 
is certainly constant along every solution curve of the system (3-1). Hence, H 
must be an integral of this system. We have now shown that a nonconstant 
function H with continuous partial derivatives is an integral of the system 
(3-1) in D if, and only if, it satisfies the first-order linear partial differential 
equation (3—5). 

Each integral H of the system (3-1) determines a family of n-dimensional 
hypersurfaces in the region D, and every solution curve lies on one of these 
hypersurfaces. Now suppose that we have found n integrals, say Hi, H 2 , 
. . ., H n , of the system (3-1) which are defined in the region D. If the Jacobian 
determinant 


dHi dJU 

dyi dy n 


d (// 1, . . ., Hn) — 
d{yi, . . ., y n ) ~ 


dHn dHn 

dyi dy„ 


(3-6) 


of these integrals is different from zero in D, it will be possible 28 to solve the n 
equations 


H\{x,yi,. . . , y«) Ci,. . ,,Hn(x,yi,. . .,yn) Cn (3 7) 

for yi , . . ., y n as functions of x. And these functions will depend on the n 
arbitrary constants Cj, . . . , c n . The n equations (3—7) therefore implicitly 
determine an n-parameter family of curves in the ( n+ 1) -dimensional space. 


28 This is a consequence of the implicit function theorem for systems of equations. See, for example, ref. 6. 
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Figure 3—1. — Relation between integral curves and surfaces of ordinary differential equations. 


But these curves must lie along the various mutual intersections of n hyper- 
surfaces 


Hi = constant, H 2 = constant, . . ., H n = constant 

This is shown schematically in figure 3-1 for the case where n = 2. However, 
since every solution curve of the system (3—1) must simultaneously lie on n 
hypersurfaces, each of which corresponds to one of the n integrals Hi, . . 
H n , the solution curves must also lie along these intersections. Therefore, 
every solution to the system (3-1) can be obtained as an implicit solution of the 
n equations (3-7). That is, the n integrals Hi, H n which have a non- 
vanishing Jacobian in D are just sufficient to completely determine all the solu- 
tions of the systems (3-1) in D. However, the requirement that the Jacobian 
(3-6) be different from zero in D implies that the integrals are functionally 
independent 29 in D. We therefore anticipate that the system (3-1) will possess 
n functionally independent integrals. This turns out to be the case, at least 
when the region D is chosen to be sufficiently small (see ref. 7). 


29 The functions Hu • • H n are functionally dependent in D if there exists a nonconstant function at of Hu • • •» 
H n such that <o(Hu ...» Hn) =0 for all x, y u . . y* in D . See ref. 6. 
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For example, consider the normal system 


dyi _ 
dx 


Gi{x,y u y 2 ) 


xy i 

y?-y| 


dy> 

dx 


= G 2 (x,y u y 2 ) 


xy 2 

yf-y? 


which is defined in some region D which does not include the planes y\ = ±y 2 . 
Since the two functions H\{x , yi, y 2 )=yiy 2 and // 2 = yf + y| — x 2 satisfy the 
equations 


d//i , dH x y-, , d//i 

T : t“ Cri “1 7 

dx dyi dy 2 


C 2 — 0 “I - y 2 



= 0 


and 


dx dyi 


G i + 


a// 2 

3y2 


C 2 = — 2x + 2yi 



we know that ffi and // 2 are two integrals oi this system. Since 


d(yi,y 2 ) 


72 

2yi 


y i 

2y 2 


= 2(yJ-rf) 


the Jacobian of these functions does not vanish in D. And this shows that Hi 
and H 2 are functionally independent. The solution curves lie on the inter- 
sections of the surfaces H i = constant with the surfaces H 2 = constant. Thus, 
every solution curve lies on an intersection of a hyperbolic cylinder and a 
hyperboloid (or cone). 

Now let Hi, H 2 , . . ., H n +i be any n + 1 integrals of the system (3-1) in 
some domain D. Since each of these integrals must satisfy equation (3-5), it 
follows that 
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8Hi djh 

dx dyi 


G\ + 


8H 2 , dH 2 n , 

+ T Gi + 
ox dy i 


dHn+i 

dx 


dHn + 1 
dyi 


Gi + 


+ ^G n = 0 


+ 


dy n 

djh 

dyn 


Gn — 0 


dHji+i 

dyn 


Gn = 0 


And it must be possible to solve these equations for 1, Gi, G 2 , . . . , G n at 
every point of D. However, this can happen only if the determinant of the 
coefficients 




dHi 

dHi 

dHi 



dx 

dy i 

dyn 



dH 2 

dH 2 

dH 2 

d(H u H 2 , . 

. -,H n+ 1)_ 

dx 

dy i 

dy n 

d(x,y u . 

• -,y«) 

• 

• 

• 



dHn+l 

dx 

dH n + 1 
dy i 

dHn+l 

dyn 


is zero at every point of D. But this means that Hi, . . H n + 1 are functionally 
dependent. Hence, every set of n + 1 integrals of the system (3-1) is function- 
ally dependent. We have, therefore, shown that the system (3-1) possesses 
precisely n functionally independent integrals (at least in a sufficiently small 
region D). 

If n functions Ii(x, yi, . . . , y n ) , • • • , In(x, yi, . . ., y n ) not all identi- 
cally zero can be found such that 
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is equal to dFjdx for some nonconstant function F(x, y u . . y n ) , then F 
will be constant along every solution curve of the system (3-1). Hence, F 
will be an integral of the system (3-1). The functions A, . . . , I n can be 
thought of as integrating factors. 30 Although it is usually difficult to find inte- 
grating factors, they can sometimes be guessed at from the symmetry of the 
problem. 

For example, consider the system 


dyi __ 


dx 


= y 2 — y 3 


dy 2 

— — = Vo — 


dx 


ya-yi 



yi— y 2 


(3-S) 


Then choosing A = h — h — 1 gives 


IX 


dy t 

dx 


(y 2 -y,)] + lx[^ 



+ 1X 


dy 3 

dx 



= dyi + dy2 
dx dx 


dy 3 _ d 
dx dx 


(yi + y 2 +y 3 ) 


Hence, 7/i = yi + y 2 + y 3 is an integral of the system (3-8). Now choose A = 
2yi, 1 2 = 2 y 2 , and 7 3 =2y 3 . Then 


2y '[* “ (y!_y »)] +2y {& _ (y»-y.)]+2y>[^- (y. -»)] 

= 2y 'f +2 *S +2 *f=l« + ^> 

Hence, 77 2 == yi+yf + y 3 is also an integral of the system (3-8). It is easy to 
verify that TA and 7/ 2 are functionally independent. 


30 See section 2.2. 
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Instead of interpreting the solution (3-2) of the normal system (3-1) 
as the explicit equation of a curve in an (n+ 1) -dimensional space with 
coordinates x, y it . . ., y„, we can interpret it as the parametric equation 
(with parametric variable x) of a curve in an n-dimensional space with coordi- 
nates y x , . . y n . The parametric variable x may be the arc length or a similar 
parameter for the solution curves (such as the time in a physical problem). 
The fundamental theorem can still be used to show that there is a solution curve 
passing through every point of any region of this n-dimensional space, provided 
the functions Gi satisfy the appropriate smoothness requirements. However, 
except in a certain important special case (which is the topic of the next sec- 
tion), it will usually not be true that only a single curve passes through each 
point of this region. 

The simplicity of the first-order normal systems becomes apparent when 
'equation (3-1) is written in vector notation. To this end we notice that the 
coordinates of a point yi, . . ., y„ in the n-dimensional space can be interpreted 
as the components of an n-dimensional vector y and we write 31 

y=(yi,y2, . . .,y«) 

A vector is said to be a function of a single real variable x if its components 
are functions of x. Thus, if (as in the case of the solution curve (3-2) of the 
system (3-1)), 

yi=/i(x), . . • ,yn=Mx ) 

we write 


y = f(x) -(/*(*), • . .,/„(*)) 


31 Addition of two vectors and multiplication of a vector by a scalar are defined component-wise. Thus, if y— 
(yi, . . y„) and x= (*i, . . *„) are vectors and a is a number, y+ x= ( (yi + *i), . . (y« + *») ) , ay = (ay u • . 

ay H ). The length or magnitude of the vector y is denoted by |y| and is defined as the distance from the origin to the 
point yu . . y». Thus, |y| = (y^ + . . . + y£) 1/2 * Notice that the magnitude of a vector is zero if, and only if, its 

components are all zeros. The dot product or inner product of two vectors y and x is denoted by y * x and is defined 
by y*x = yixi+ . . . +y»x«. The dot product satisfies the Schwarz inequality |y • x| ^ |y| |x|, where the vertical 
lines on the left are the absolute value of a pure number and those on the right denote the magnitude of vectors. Thus, 
if |y| ¥■ 0 and |x| # 0 then |y ■ x|/|y| |x| 1 and this quantity can be interpreted as the cosine of the angle between 

the vectors y and x. If y • x = 0, we say that y and x are orthogonal; and if |y ■ x|/|y| |x| = 1, we say that the vectors 
are in the same direction. The vector (1, 0, 0, . . 0) has unit magnitude and is in the direction of the y r coordinate 

axis. The vector (0, 1, 0, 0, . . 0) has unit magnitude and is in the direction of the y 2 -coordinate axis, etc. We denote 

these vectors by ki, k 2 , etc. For more information, see ref. 8. 


I 
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Differentiation of vectors is defined component-wise, and we write 

dy _ / dyi dy n \ = df(x) _ ( dfi{x) df n (x) \ 

dx \dx' ‘ dx } dx \ dx ’ ' ' ' ’ dx ) 

If f(yi, . . . , y n ) is a function of the n variables yi, . . . , y n and y = 
(yi, . . y n ) denotes the vector corresponding to these variables, we write 


f(yu ■ • -,yn)=f( y) 


and say that / is a function of the vector variable y. More generally, we say that 
a vector is a function of a vector variable if its components are. Thus, if yi = 
/i(x), . . y„=/„(x), then 


y = f(x)= C/i(x), . . .,/n(x)) 

is a vector function of the vector variable x. 

Notice that there is no such thing as the derivative of a function of a vector 
variable with respect to that variable, but only partial derivatives with respect 
to the components of the vector variable. Thus, if ki, . . . , k n denote the unit 
vectors in the directions of the coordinate axes, we define the vector gradient 
operator V by 


V/(x) = k 1 


d/(x) 

dxi 


. . . + k„ 


d/(x) 

dXn 


Just as in the three-dimensional case, the effect of operating with the gradient 
operator on a scalar produces a vector. 

A vector function is said to be continuous if its components are continu- 
ous. Thus, in the system (3-1), each Gi is a function of a real variable x and a 
vector variable y and we write 


Gi(x, yi, . . ., y„) =Gi(x, y) 


If we let G be the vector whose components are Gi, then G is a vector function 
of the real variable x and the vector variable y and 
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G(*, y) = (G^x, y), . . .,G n (x, y)) 


Hence, upon using the component-wise definition of differentiation of vec- 
tors, the first-order normal system (3-1) can be written in the concise form 


dx 

dx 


= G(x, y) 


When the first-order system (3-1) is written in this form, the analogy between 
this system and a single first-order equation becomes particularly apparent. 


3.2 AUTONOMOUS SYSTEMS 


When the functions G, in the system (3-1) do not involve the variable x 
explicitly, the system is said to be autonomous and it can be written in the 
form 


i£= G ' (y " *• 

&=c 2 (y t ,n. 


^ = C„(y„y 2 , 



(3-9) 


If the independent variable x is thought of as representing time, autonomous 
systems can be interpreted as time-independent, or stationary, systems. 

The first-order normal system of n — 1 equation 


^=Gi (y,,y 2 , • • • ,y n -i,x ) 

~j“7 = G 2 (yi, y 2 , • • x ) 


dy n - 


^ i =G„-i(yi,y 2 , . . -,y»- ux)f 


(3-10) 
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can always be transformed into an autonomous system in n variables which has 
the property that at least one of the functions on the right side never vanishes. 
In order to do this, it is only necessary to introduce a new dependent variable 
Yn by 

yn = X 

and then rewrite the system (3-10) in the form 

^r=Gi(yi, y-2, . . y n ) 

^=G 2 (yu y-z, . . . ,y n ) 


=Gn-i(yi> y2, . • y«) 

dy » =1 

dx 

Conversely, if one of the G,-’s of the autonomous system (3-9) in n variables 
does not vanish in some domain Do, it can be transformed into a nonautonomous 
system in n-1 variables plus an additional equation which can be solved to 
determine the variable x once the remaining variables have been found. We 
may assume without loss of generality that the notation has been so chosen 
that G n does not vanish in Do. Now in order to accomplish this transformation 
put 


x = y n 


(3-11) 


and 


G~ — 


for 1 


(n-1) 


(3-12) 
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Then the system (3-9) becomes 


dy i 
dx 


=G 1 (y 1 , . . . ,y n -i,x ) 


<*y 2 ^ , 


y«-i, *) 


dy n -i 

dx 


G n -i(yi , 


y«-i, *) 


(3-13) 


and the equation which determines x once the system (3-13) is solved is 


dx 1 

dx G„(y u . • ., yn-h x^) 


(3-14) 


It can now be shown that every solution to the system (3-13) in Do is a 
solution of the system (3-9), and conversely. Thus, we may say that the systems 
(3-9) and (3-13) are equivalent in Do. 

The solutions of the system (3-9) in n variables can be found by solving 
the n — 1 simultaneous equations (3-13) to determine yj, . . ., y«-i as functions 
of y n . And then a single equation obtained by substituting these solutions into 
equation (3-14) can be solved to determine the parametric variable x as a 
function of y n . In order to emphasize the connection between the autonomous 
system of n equations (3-9) and the system of n — 1 equations (3-13), the 
former system is sometimes denoted symbolically by 


dyi = dy 2 _ dy n , 

. sx ax 

1 ^2 


or, when finding the variable x is not important, by 

dy\_dy 2 _ _ dyn 

Gt G2 G n 
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Thus, for example, consider the autonomous system of three equations 

^=Gi(y u y 2 ,y 3 ) = myty 2 y% 


^=G 2 (y u y 2 ,y 3 ) 


(3-15) 


^ = G 3 (yi , y 2 , y 3 ) = yiy2y 3 


Upon putting i = y 3 we get 


dy i 




Gi(y!,y 2 ,5) = rayji 


dy 2 /s , ny 2 

- JF =e 2 ( y 1 ,y 2 ,A : ) =- — 


ac 


(3-16) 


and x is determined by the equation 


dx 1 _ 1 


dx yiy 2 £ 


(3-17) 


The system (3-16) has the solution 


mx 2 l2 my 2 / 2 

y 1 = C ,e = Cie 

c 2 

n X n 


Finally, equation (3—17) shows that x is related to x by 


x = c 2 + 


— [ x"- 1 

ClC -2 J 


exp 


mx 2 


dx 
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The interpretation of the solution of a normal system of n equations as a 
curve in an n-dimensional space is particularly useful when the system is autono- 
mous. This is the case because for autonomous systems, unlike nonautonomous 
systems in general, there is usually only a single curve passing through each 
point in this space. We can show that this is the case by applying the funda- 
mental theorem given in section 3.1 to the transformed system (3—13). This 
theorem applies to the yi, . . ., y n -i, x space in which the system (3-13) is 
defined. But in view of equation (3-11), we see that this is the same as the 
re-dimensional y t , . . y n space in which the untransformed autonomous 
system (3-9) is defined. Thus, suppose that the functions G, have continuous 
first partial derivatives with respect to all their arguments 32 at every point 
in some domain D of the n-dimensional yi, . . y n space and that there is 
no point of D where all the Gj’s simultaneously vanish. Then, because of the 
continuity of the G\ s, we can assert that for any point y?, . . ., y® of D there 
is a neighborhood Do in which at least one of the Gj’s is never equal to zero. 
Within this neighborhood we can transform the normal system (3-9) into the 
system (3-13) and apply the fundamental theorem in the region Do. Since the 
Gi s have continuous partial derivatives, we can conclude that the system (3-9) 
has precisely one solution curve in the 77-dimensional yi, . . ., y n space pass- 
ing through the point yf, . . ., y®. Now suppose the notation is chosen so that 
G n does not vanish in Do. Then G„ does not change sign in D 0 , and equation 
(3-14) shows that x is a monotonic function of x along any solution curve 
in Do. Hence, we can take x as the parametric variable for the solution curve 
instead of x. Since this is true for every point of D, we arrive at the follow- 
ing conclusion: 33 If the function G which appears in the first-order autonomous 
system 



(3-18) 


possesses continuous partial derivatives at every point of some domain D, and 
if |G(y)| does not vanish at any point of D, the system (3—18) has exactly 
one solution curve 


32 Notice that in this case the existence of the partial derivatives guarantees the continuity of the functions G*. 
33 We are using the vector notation introduced in section 3.1. 
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y=f(*, y 0 ) (3-19) 

passing through each point y 0 of D. 

Thus, we can think of the “solution vector” (3-19) as tracing out a con- 
tinuous curve in the ^-dimensional y u . . ., y n space, which passes through 
the point yo when the parametric variable takes on some value, say xo. Such 
“solution curves” are called integral curves of the system (3-18). These ideas 
are illustrated in figure 3-2 for the case where n — 3. 

Notice that in order to ensure that the system (3-18) has only one integral 
curve passing through each point of D, it is necessary to require that |G(y)|, 
the magnitude of G(y) , does not vanish at any point of D. Points where |G(y) | — 
0 are called critical points of the autonomous system (3-18). 

If yo is a critical point of the system (3-18), this system must possess the 
constant solution 

y = y 0 

which is called an equilibrium solution. 

For example, it may be verified by inspection that the solutions to the 
autonomous system 



Figure 3-2. — Relation between solution vector and integral curve of differential equation. 
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dy i 


with n> m, are 


dy 2 


yi = Cie mx 


y 2 = C 2 e" x 


The point (0, 0) is a critical point. The integral curves of this system are the 
loci of points 

(y 2 ) m =^(yi)" 

where k=cf\c ?. These curves are shown for the case where m and n are 
positive integers in figure 3-3. It can be seen from this figure that there is 



FIGURE 3-3. — Integral curves for sample problem. 
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exactly one integral curve passing through each point except the critical point 

( 0 , 0 ). 

A vector function which is defined at each point of an n-dimensional 
space is said to constitute a “vector field.” Since a unique vector-valued 
function G(y) corresponds to each autonomous system (3-18) and conversely, 
we can say that each first-order autonomous system is characterized by 
its vector field. 

Consider a portion of an integral curve of the system (3-18) which passes 
through a point y which is not a critical point and which lies in a domain D 
where G(y) is continuously differentiable. Let S denote the arc length meas- 
ured in some definite direction along this curve. Then 


{dSy* ~{dy x ) 2 + (dy 2 ) 2 + . . . + (dy„) 2 


or 


dS — ±dx 


dydy 
dx dx 


But equation (3-18) shows that 


dS 

dx 


=± VgTg=± 


|G| 


Hence, takingS as the independent variable in equation (3-18) shows that 


dy _ dy dx _ G 
dS~ dx d5 _± ]G| 

However, the vector dy/dS is the unit tangent vector to the integral curve 
passing through y; and since there is an integral curve passing through each 
point of D, we conclude that the vector field G(y) is tangent to the integral 
curves of the system (3-18) , except possibly at the critical points. 

When n — 3, we can imagine that the variable x represents time and the 
autonomous system (3-18) represents the steady flow of a fluid in space. At 
each point y in a certain region of this space the vector G(y) describes the 
velocity of the fluid at that point in both magnitude and direction. The flow 
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is steady because its velocity depends only on position and does not vary with 
time. The integral curve passing through the point yo may then be interpreted 
as the streamline followed by all the fluid particles which have passed through 
the point yo. The velocity field is tangent to the streamlines at every point ex- 
cept at the critical points which correspond to “stagnation points” where the 
fluid velocity vanishes. At these points, it is possible for two or more stream- 
lines to meet or for a given streamline to abruptly change direction. 

By again applying the results obtained in section 3.1 to the autonomous 
system (3—9) by means of the transformed system (3—13), we arrive at the 
following conclusions: Suppose that D is a region of yi, . . . , y n space which 
contains no critical points of the system (3-9) and that G has continuous partial 
derivatives at every point of D. Then a function H(y) of the vector variable 
y in D which does not depend explicitly on x is a solution of the first-order 
linear partial differential equation 

AH 

C,(y)f-+. . . + G n (y)p-=0 (3-20) 

dyi oy n 

if, and only if, it is an integral of the autonomous system (3-9). If, in addition, 
the region D is sufficiently small, the system (3-9) possesses precisely n — 1 
functionally independent integrals which are independent of x. This shows that 
equation (3-20) possesses n — 1 functionally independent solutions. 

The level surfaces of these integrals are hypersurfaces in the n-dimensional 
yi, . . . , y n space. The intersection of any n-1 of these hypersurfaces (no 
two of which correspond to the same functionally independent integral) is 
an integral curve of the system (3—9) in this space. 

The partial differential equation can be written more compactly by using 
the vector notation introduced in section 3.1 to obtain 

G VH = 0 

In this form the equation has an immediate geometrical interpretation. As in 
three-dimensional space, the vector V// is perpendicular to the level surfaces 
of the function H. Hence, the differential equation (3-20) merely states that the 
vector field G is everywhere tangent to the level surfaces of its solutions. This 
is consistent with the facts that the integral curves of the system (3-9) lie on 
these level surfaces and that the vector field G is tangent to these integral 
curves. 
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3.3 SOLUTIONS OF THE FIRST-ORDER LINEAR PARTIAL DIFFERENTIAL EQUATION 

Since equation (3-20) cannot have n functionally independent solutions, 
there must exist for any n solutions Hi, . . . , H n of equation (3-20) a non- 
constant function a) such that 


( o{Hi,H 2 , . . H n )= 0 


for all yi, . . . , y n in some region D. Now it can be shown (ref. 6) that this 
equation can always be solved for H n to obtain 


Hn = F(Hi, Hi, . . ,,Hn-l) 

provided that Hi, . . H n -\ are functionally independent. But since Hi , . . ., 
H n were any solutions of equation (3-20), this shows that every solution of 
equation (3-20) can be expressed as a function of any n — 1 functionally inde- 
pendent solutions. Conversely, it is easy to verify by direct substitution that if 
Hi, Hz, . . H n - 1 are any solutions of equation (3-20) and F is any continu- 
ously differentiable function of Hi, . . ., H n -i, then F(Hi, Hz, ■ ■ ., H n - 1 ) is 
also a solution of equation (3 _ 20). Thus, we have shown that H is a solution of 
the linear partial differential equation (3-20) if, and only if, there is a function 
F such that 


H(y)=F(Hi(y), . . .,H n - i(y)) 


where Hi(y), . . . , H n -i (y) are any n — 1 functionally independent integrals 
of the autonomous system (3-9) which do not depend on x. 

Thus, the most general solution of the partial differential equation (3-20) 
can be found if the system (3-9) of ordinary differential equations can be solved. 
The differential equations (3-9) are therefore called the characteristic equations 
of the partial differential equation (3-20) and their integral curves are called 
characteristic curves of the partial differential equation. 

For example, the characteristic equations of the partial differential 
equation 


(y.-r 3 ) ff=o 


(3-21) 
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are the equations of the system (3-8). But we have shown that this system has 
the two functionally independent integrals 


Hi = yi + y 2 + y3 


#2— yj’ + yl + yl 


Thus, for every differentiable function F, the function 


H=F{H U H 2 ) 


must be a solution of equation (3-21). This can easily be verified by substitut- 
ing the relations 


dH dF | 0 dF 

dyj-BHx Zyj dH 2 


for y= 1,2,3 


into equation (3-21). 

The level surfaces of the integral Hi are planes, and the level surfaces 
of the integral H 2 are the surfaces of spheres. The intersections of the surfaces 
H\ — constant with the surfaces H 2 = constant are therefore circles. Hence, 
the characteristic curves of equation (3-21) are circles. 


3.4 QUASI-UNEAR PARTIAL DIFFERENTIAL EQUATIONS OF THE FIRST ORDER 

There is also a close connection between the quasi-linear partial differential 
equation 


Gt (y u 




+ G„_i(yi, 


• ,yn-lj ') 


dy n - 1 


= G„(yi, . . . , y n -i,f) (3-22) 

and the linear partial differential equation (3-20) and, therefore, also between 
this equation and the autonomous system (3-9). In order to show this, let 

H = h(y u . . . ,y n ) 
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be any solution of equation (3-20) such that dH/dy n is not identically zero. 34 
Then the implicit function theorem shows that there is a neighborhood D of 


any point yo for which 

r^° 

(3-23) 


dy„ 

such that the equation 

h{yi, . . .,y n ) = ci 

(3-24) 

has a solution 

yn=f(yu • • .,y«-i,ci) 

(3-25) 


in D. This means that when equation (3-25) is substituted into equation (3-24), 
the resulting expression is identically constant for all values of yi, . . y n -i- 
Hence, upon differentiating this expression successively with respect to 
yi, . . ., y«-i, we obtain 


BH + dH_ Bf_ 

dyi dy n dyi 


JIL+m §L o 

dy n -i dy„ dy n - 1 

But when these relations are substituted into equation (3-20) , we find that 


dH 

dyn 




= 0 


And, in view of equation (3-23), this shows that equation (3-25) is a solution 
of equation (3-22). 


34 This means that H is not independent of y n . 
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Now there are n — 1 functionally independent integrals Hi, . . ., H n -\ 
of the system (3-9), which are therefore functionally independent solutions 
of equation (3-20). If there is some point y at which these integrals all satisfy 
condition (3-23) , we can find at least n — 1 functions 

y»=fAyi, . . .,y n -i,c,) 

' (3-26) 

y n =f n -i(yi, . . .,y„-i,C n -i) < 

which, respectively, satisfy the equations 

Hi = hi(y t , . . y n )=ci ' 

(3-27) 

Hn-i = hn-i (yu • . •■>yn) = C n - 1 .< 


in some neighborhood D of the point y. Therefore, each of the functions (3-26) 
is a solution to equation (3-22) in D. Since equations (3-26) determine explic- 
itly the same surfaces as equations (3-27) determine implicitly and since 
these surfaces determine the integral curves of the system (3-9), the equations 
of this system are also called the characteristic equations of equation (3-22). 

For example, the partial differential equation 

ln~f> £+</-*) l£=rt-r* (S- 28 ) 

oyi oy 2 

has the same characteristic equations as the linear equation in the preceding 
example, namely equations (3-8). These equations, as we have seen, have the 
two functionally independent integrals 
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Ht = yi+y 2 +y3 


H 2 =yi+yi+yi 


And equation ( 3 - 28 ) therefore has the solutions 


/i = c,-(yi+y 2 ) 
ft—± Vcf— (yf+yf) 
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Elementary Methods for Second-Order Equations 


In this chapter several methods for obtaining solutions to second-order 
differential equations are presented. Even though each of these methods applies 
to a relatively narrow class of equations, they are still of sufficient generality 
to be useful in practice. However, unlike the methods presented in chapter 2 
for solving first-order equations, most of the techniques presented in this chap- 
ter will not by themselves yield solutions to equations. They are, in fact, 
methods for transforming certain types of equations into simpler equations 
whose solutions can hopefully be found by known methods. We have already 
found a number of methods for solving first-order equations. The only other 
general class of equations for which it is possible to find numerous solutions 
is the class of linear equations introduced in chapter 1. Therefore, in section 
4.1 we present some techniques for reducing second-order equations to equa- 
tions of the first order. And in section 4.2 we present techniques which trans- 
form certain types of nonlinear equations into linear equations. Finally, in 
section 4.3 we present a number of unrelated methods. Although many of the 
methods presented in this chapter are applicable, with some obvious modifi- 
cation, to equations of higher order, we shall, for simplicity, limit the dis- 
cussion to second-order equations. 

As indicated in chapter 1, the most general second-order differential 
equation is an equation of the form 

0 <«> 

whereas, the most general normal second-order equation is of the form 
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<Py _ r (dy \ 

dx 2 G \dx' y,x ) 


(4-2) 


A general solution of equation (4-1) or of equation (4-2), if it exists, will 
involve two arbitrary constants. As in the case of first-order equations, there 
may be any number of general solutions to these equations. In addition, equa- 
tion (4-1) may possess singular solutions. These solutions are discussed in 
section 1.5. The methods presented in that section should be used to find any 
singular solutions which may be present since such solutions can be important 
in physical problems. 

Finally, the second-order linear equation is an equation of the form 

P°^^ + P^ x ^ + P 2 ^y =P3 ^ ( 4 "3) 

We shall assume the coefficients are continuous. Upon dividing through by 
the leading coefficient po(x ) , we obtain the (essentially) normal form 

^+ p (*)^+<7(*)y=r(*) (4-4) 

This differential equation is equivalent to equation (4-3) in any interval in 
which po(x) does not vanish. 

4.1 EQUATIONS WHICH ARE REDUCIBLE TO FIRST-ORDER EQUATIONS 

In this section a number of techniques, which can be used to reduce 
certain types of equations to equations of the first order, are described. 

4.1.1 Dependent Variable Missing 

When a second-order differential equation does not explicitly contain the 
dependent variable y, it must be of the form 



& ^ = o 

dx 2 ' dx' ) 


(4-5) 


But this equation can be written as a first-order differential equation 
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= 0 


for the quantity p defined by 



(4-6) 


It may or may not be possible to solve this first-order equation in any given 
case. However, suppose that it can be solved and that its integral is 

f(x, p) = constant = C\ (4-7) 

At this stage, there are two possible ways of proceeding, depending on whether 
it is easier to find an explicit formula for p as a function of x which solves 35 
equation (4-7) or whether it is easier to find an explicit formula for x as a 
function of p. 

First, suppose that the former case occurs. Thus, we can find an explicit 
formula 


p = g(x, Ci) 


for a solution to equation (4-7). Then, in view of equation (4-6), this equation 
can be immediately integrated to obtain a general solution 



C\)dx + C 2 


of equation (4-5). 

Next suppose that it is easier to find an explicit formula 

x = h(p, Ci) (4-8) 

for a solution to equation (4-7). Substituting equation (4-8) into the relation 

dydp 

P dpdx 


35 See section 1.1. 
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shows that 


Hence, 


dy _ dx_ dh 
dp P dp ** dp 



|dp + c 2 


And, upon integrating this by parts, we obtain the equation 

y=ph(p, Ci) —J h(p, Ci)dp + C 2 (4-9) 

which together with equation (4—8) determines the solution y of equation (4—5) 
parametrically as a function of x (with p being the parametric variable). This 
parametric function involves two arbitrary constants. 

For example, the equation of the curve followed in the pursuit of a prey 
which moves along the y-axis (ref. 9) is 

KVT+ y*=(a-x)y" 

where K is the ratio of the velocity of the prey to the velocity of the pursuer. 
Put y' = p to obtain the first-order separable equation 

K Vl+p 2 = 

which can easily be solved (section 2.3) to obtain 

(a-*)-] 

If K 5 ^ 1, this equation becomes, upon integration, 

5 '=T [TTR - x)K " + C,(K-i ) +C* 

(If K= 1, the integration yields a logarithmic term.) 
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4.1.2 Independent Variable Missing 


The general form of the second-order equation in which the independent 
variable does not appear explicitly is 



We again define p by 



Then substituting this together with the relation 

d l y dp dp dy dp 
dx 2 dx dy dx dy y 


(4-10) 

(4-11) 


(4-12) 


into equation (4-10) shows that p is determined by the first-order differential 
equation 


F ($pp,p,y)=o 

Suppose that this equation can be solved and its integral is 

f(p , y) = Ci = constant (4-13) 

It is again possible to proceed in two different ways. First, suppose that it is 
easier to find a formula for the solution p = g(y, C i) of equation (4-13) for p 
as a function of y. Then combining this with equation (4-11) and integrating 
provides a general solution 


J; 


dy 


g(y , C,) 


C> = x 


of equation (4-10). 

On the other hand, if it is easier to find a formula 

y=h(p, Ci) 


(4-14) 


77 



DIFFERENTIAL EQUATIONS 


for the solution of equation (4-13) which expresses y as a function of p, equation 
(4-12) shows that 


dx_ 1 dy _ \ dh 
dp - p dp p dp 

But this equation can be integrated by parts to obtain the equation 


x=C 2 +~ h(p, 

P 



h(p, Ci)dp 


(4-15) 


which, together with equation (4-14), determines the solution to equation (4-10) 
parametrically. 

For example, the differential equation 


y 


d 2 y 

dx 1 


+ v 



(4-16) 


occurs in the field of fluid dynamics. 36 Substituting equations (4-11) and (4-12) 
into this equation yields the first-order equation 


r p^ + „ p! = o 

Since this equation is separable, it can be integrated to obtain p = Cdy v . 
Hence, combining this with equation (4-11) and integrating shows that 


yV 


+ 1 


v+1 


C \X + C'i 


is a solution of equation (4-16). 

Notice that, when A , B, C, and D are constants, the differential equation 

y" =A + By+Cy 2 + Dy 3 


“More specifically, it governs the velocity field in a boundary layer in the neighborhood of the separation point. 
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does not explicitly contain the dependent variable x. When the procedure 
just described is applied to this equation, we encounter integrals which usually 
cannot be evaluated in terms of elementary functions. These integrals, however, 
can always be expressed in terms of elliptic functions. The differential equation 
is therefore called the elliptic equation. For more details concerning the proce- 
dures involved in evaluating these integrals and the properties of elliptic 
functions, the reader is referred to reference 10 (chapter 6 and section 11 of 
chapter 7). 

Finally, notice that a linear homogeneous equation does not contain the 
independent variable explicitly if, and only if, it has constant coefficients. 37 

4.1.3 Homogeneous Equations 

The general definition of a homogeneous function of degree k has been 
given in section 2.3. We shall now show that if the second-order differential 
equation (4-1) has certain homogeneity properties, it can be solved by ele- 
mentary means. 

4. 1.3.1 Equations homogeneous in the dependent variable and its deriva- 
tives.— First, suppose that equation (4-1) is homogeneous of degree k in the 
variables y, y', and y"; that is, it satisfies the homogeneity condition 

F(ty",ty', ty, x) = t k F{y" ,y\y , *) (4-17) 


for any number t. Hence, if we put t= 1/y, equation (4-17) becomes 

which shows that the differential equation (4-1) can be written in the form 

f ( 2 p ; y ,1 ’*) = 0 (4 ' 18) 

by factoring out y k . In order to reduce the order of this equation, define the 
function u(*) by 


37 We assume that the reader knows how to solve linear equations with constant coefficients. 
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u= 


y_ 

y 


(4-19) 


Then 


y" 

— =u' + u 2 

y 

Upon substituting this, together with equation (4-19), into equation (4-18) 
we obtain the first-order equation 

F(u' + u 2 ,u, 1,*)=0 (4-20) 

Suppose this equation can be solved and its integral is 

f{u,x)=Ci (4-21) 

Then solve equation (4-21) for u as a function of x to obtain 

u=g(x,C 1 ) (4-22) 

But substituting this into equation (4-19) and integrating shows that the 
original equation (4-1) has the solution 


In y= 



C\)dx-\-Ci 


(4-23) 


If the differential equation (4-J) is linear, the first-order equation (4-20) 
obtained by this method will be a Riccati equation which will usually be 
more difficult to solve than the original equation. 

As an example of the method, consider the equation 

F(y", y' , y, x) = yy"-y ' 2 + y 2 x=0 (4-24) 


and replace y, y', and y" by ty, ty ' , and ty" , respectively, to get 

(ty)(ty")~ {ty') 2 + ( ty) 2 x= t 2 (yy" -y' 2 + y 2 x)=t 2 F(y" , y ' , y, x) 
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This shows that F is homogeneous of degree 2 in y and its derivative. Hence, 
upon introducing the transformation (4-19), the differential equation (4-24) 
becomes 


u' + u 2 — u 2 + x = u' + x = 0 


But this equation can be immediately integrated to obtain 

u+~2=Ci 

Hence, 

In y~J {Ci-^\dx-¥C 2 

The solution to equation (4-24) is, therefore, given by 

y=C 3 exp 


where we have put C 3 = e c *. 

4 . 1 . 3.2 Isobaric equation. — The isobaric equation of the first order in 
normal form was encountered in section 2.3. In the general case, the function 

p/d m y d m ~ l y dy 

\dx m ’ dx m ~ »' ' ’ 'dx ,y,x 

is said to be isobaric if there exist numbers k and l such that 



d m y 

dx mJ 


fl—m+l 


d m ~ 1 y 

dx m ~ v 



dy 

dx' 



(4-25) 


for all values of t. And the differential equation 

F[y( m) ,yl m - . . . ,/,y, x]=0 
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is said to be isobaric if the function F is isobaric. It is easy to see that the dif- 
ferential equation (2-17) is of this type (for m=l). For the second-order dif- 
ferential equation 

F(y'\y',y,x) = 0 (4-26) 

the condition (4-25) becomes 

F(t l ~ 2 y" , t l ~ l y', t l y, tx) = t k F(y", y\ y, x) (4-27) 

Thus, in particular, upon replacing t by 1/x, equation (4-27) becomes 

jh W'* y* y* *) =0 

And this shows that equation (4-26) can be written in the form 

j)-° (4-28) 

However, when we introduce the new variables 

u = x~ l y (4-29) 

and 

^ = ln r (4-30) 

into this equation, we obtain an equation 

F[;p+(2Z-1) u, lj =0 (4-31) 

in which the independent variable £ does not appear explicitly. Since this 
equation can always be reduced to a first-order equation by the methods of 
section 4.1.2, it follows that the isobaric equation of the second order can also 
be reduced to such an equation. 
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The homogeneous linear isobaric equation of the second order is an equa- 
tion of the form 

x 2 y ' ' + poxy' + q 0 y=0 (4-32 ) 

where po and qo are constants. It is known as the homogeneous Euler equation. 
Since this equation satisfies condition (4-27) with l=k= 0, the change of 
variables u = y, £=ln x will transform it into the equation 

^£2+ (Po — 1) ^ + <?ott = 0 

The independent variable does not appear explicitly in this equation and the 
coefficients are constants. 38 

This equation, therefore, has two solutions of the form u = e Cf , one for each 
of the roots of the equation. 

C 2 +{p 0 -l)C + <7o = 0 

It is easy to see that the general Euler equation 

x 2 y" + poxy' + qoy= r(x) 

can also be transformed into an equation with constant coefficients by the 
change of variables 

u = y g — In x 

A less elementary example of an isobaric equation is provided by the 
equation 

F(y" ,y' ,y,x)=yy" - fr' 2 +(f) =0 (4-33) 

It is easy to verify that, when the function F is given by equation (4-33), the 
condition (4—27) will be satisfied for any value of / provided that k = 2l — 2. 
Hence, upon introducing the change of variables 

38 Which is as it should be since the equation is linear. 
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£=ln*j 


the differential equation (4—33) is transformed into the equation 



(4-34) 


(4-35) 


Since in this case any value of l can be used, we choose its value to 
simplify equation (4—35). This is accomplished by setting 1 = 2, whereupon 
equation (4-35) becomes 


u 


d 2 u 3 (duy 

de 4Uf/ 


But this is a special case of equation (4—16) with v=— 3/4 and, therefore, has 
the solution 


4 u'i* = C 1 Z + C 2 


Hence, by using equations (4—34) and (4—35), the solution to equation (4—33) 
is found to be 


, ( Cilnx+Cz \ 4 
y=X { 4 j 

4.1.4 Method of Variation of Parameters 

This method applies only to linear equations. We have seen in chapter 1 
that the homogeneous equation (associated with eq. (4—4)) 


y" +p(x)y' + q(x)y= 0 (4-36) 

possesses exactly two linearly independent solutions in any interval in which its 
coefficients are continuous, and that the general solution of the nonhomoge- 
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neous equation (4—4) in this interval is the sum of any particular solution of that 
equation plus an arbitrary linear combination of two linearly independent 
solutions of (4-36). 

We now suppose that by some means a nontrivial solution of the asso- 
ciated homogeneous equation (4-36) has been found. Substituting 

y=vy H 


(where v is a function to be determined subsequently) into equation (4-4) and 
using the fact that yH satisfies the homogeneous equation (4-36) to simplify 
the result shows that 


yHv"+ (2y' H +py H )v' = r 

But this is a first-order linear equation for v' . It can therefore be solved by the 
methods of section 2.3, to obtain 

yj 1 v'e^ pdz = J r(x)yH(x)e^ Hx)dx dx + Ci 


When this equation is solved for v' and integrated, we obtain an expression for 
v (in terms of known functions) which contains two arbitrary constants. Hence, 
substituting this expression into y=ytfv gives the general solution to equation 
(4-4). 

For example, it is easy to find by inspection that yH — x is a homogeneous 
solution of the equation 


,.n 2x , , 2 **+.3* 

y ~ l+x ,y + 1 +x *y- 1+x , 


(4-37) 


Hence, substituting y=xv into this equation shows that v' satisfies the first- 
order linear equation 


2 , _ * 2 + 3 

x(x 2 +l) V x 2 + l 


which can easily be integrated to obtain 
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x 2 v' 


X - 3 


x 2 +l x 2 +l 


■ c i 


But solving this equation for v' and integrating shows that v is given by 

V= ^ + Cl { x ~l) +Ci 

Therefore, the general solution to equation (4-37) is 


y=xv = ^-+Ci (x 2 — 1) + C 2 x 


4.1.5 Equations Invariant Under a Transformation Group 

The group-theory method for reducing the order of equations is a generali- 
zation of all the techniques previously discussed in this section. However, this 
technique cannot be applied routinely since, as will be seen, it is necessary to 
find a group under which a given equation is invariant and there is no con- 
structive procedure for accomplishing this. The method is more useful for 
working backwards to find general methods for solving particular classes of 
equations by starting with a given group and testing equations for invariance 
under this group in much the same way that equations are tested to see if they 
are homogeneous. 

A single parameter Lie group 39 in two dimensions is a family of coordinate 
transformations 


xi = f(x,y; a) 
yi=g(x,y; a) 


(4-38) 


in which the members of the family are individuated by the values of the param- 
eter a. In addition, the family must contain an identity transformation which, 
without loss of generality, we can associate with the value a = 0. Thus 


39 Various continuity hypotheses, which will not be presented here, are required for a rigorous treatment of the 
theory. 
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x = f(x,y; 0) 

y=g(x,y; 0) 

Consider an infinitely differentiable but otherwise arbitrary function of the 
coordinates F(x, y). If the point ( x , y) is transformed by the group (4—38) into 
the point (*i, yi) for a particular value of a, the value of F at (x\,y\) is given 
by the Taylor series whose leading terms are 


F( I „y,) = F(x,y) + a(£^+^^) <i _ # F(x,y) + . . . 

= FU,y) + a^(r,y) ^+rj(x,y) f^F(x,y) + . . . 


(4-39) 


where we have put 


and 7l(x ' y) ~(^) 

In particular, if F is taken successively as x and y, then 

Xi = x-\-at;{x, y) + . . . 
yi = y+anr){x, y) + . . 


(4-40) 


(4-41) 


When a is infinitesimally small, the transformation (4—41) differs only infin- 
itesimally from the identity transformations; therefore, £ and r\ are referred to 
as the infinitesimal transformations of the group. The operator 

= £(*,y)|^(*,y)|^ 

is called the infinitesimal operator of the group. By using this operator notation, 
the Taylor series (4—39) can be expressed as 


F(xu yi) = F(x, y) + aUF -\--^a 2 U 2 F . . . 
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But this series can be formally summed to obtain 

F(x i, y x ) = e aU F(x, y) 

In the particular cases where F is taken to be the coordinates themselves, this 
becomes 

Xi = e aU x 

y i = e aU y 


These equations are equivalent to the equations (4—38) which define the original 
finite transformation. This shows that the finite transformation is completely 
determined by the infinitesimal transformations. 

In order to illustrate these ideas, consider the magnification group 


xi = f(x, y; a) = e a x 
y x = g(x, y; a) = e a y 


(4-42) 


It is apparent that we obtain the identity transformation when a = 0. Inserting 
the equations of this group into equations (4—40) shows that the infinitesimal 
transformations of this group are 


(£L„ = * 

and therefore that the infinitesimal operator is 

TT 3 , 3 

U = x~ — hy— 
dx dy 

Applying this operator repeatedly to the coordinates yields 
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Ux=x Uy=y 

U 2 x—x U 2 y=y 

U z x=x U 3 y=y 


Hence, 


X\ = x + ax a 2 x +^j a z x + 


yi = y+ay+2 a 2 y+^a 3 y+. . . 

And upon summing these series, we obtain the original group 


x i = e a x y 1 = e a y 


The differential equation 


G(x,y,y',y") = 0 (4-43) 

is said to be invariant under the group (4-38) if introducing the new variables 
X\ and yi, given by equations (4-38), into this equation leads to the equation 


G(xi,yi,y[,y[') =0 


This means that the change of variable given in equations (4-38) does not alter 
the form of the differential equation (4-43). 

For example, the differential equation 

C(*,y,/,y") -*y"-*-(£,y’) = 0 (4-44) 


is invariant under the magnification group (4—42) since it follows from equations 
(4-42) that 
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y_yi dy dy\ d 2 y _ d 2 y\ 

x xi’ dx dxd X dx 2 Xl dx\ 


We state without proof that a necessary condition for the differential 
equation (4—43) to be invariant under the group (4-38) is that 


U 0 G = 0 


(4-45) 


where the operator Uo is defined by 


u, -fh +1 >h +l W' 


dy' 


with 


and 




= ^2L^^"dL_,£i 

^ dx 2 ^ dx ^ dx 2 

In performing the partial derivatives we treat x, y, y ' , and y" as independent 
variables; whereas, in performing the total differentiations we treat these 
quantities as functions of x. 

For example, since we have shown that 

£ = * r) = y 

for the magnification group (4-42), it follows that 
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for this group. The operator Uo is, therefore, given by 

Uo = xf + yf (4-46) 

dx dy 

But since 



applying the operator (4-46) to equation (4-44) shows that 


nr- dF ^ dF n 

U 0 G — -y - — K y — — 0 
dy dy 


which is consistent with the result obtained in the previous example. 
We have shown in chapter 3 that the system 40 


dx _ dy _ dp 

Hx,y) r)(x,y) £(x,y,p ) ' 4 47 ' 

has two functionally independent integrals, u(x,y) and v(x,y,p), one of which 
can be chosen independent of p. The following result allows us to apply the 
ideas of group theory to reduce the order of a differential equation: If the 
differential equation (4-43) is invariant under the group (4-38), then the 
equation obtained by introducing the integrals u and v as new variables in this 
equation is of the first order. 

Thus, we have seen in the preceding examples that equation (4-44) is 
invariant under the magnification group (4-42) and that, for this group. 


i = x v = y £=0 X = 0 

Hence, the system (4—47) becomes 

dx _ dy _ dp 
x y 0 


40 We have written p = y 
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which has the two functionally independent integrals 


But since 


we see that 


u=- and v=p 
x 


dp_dv_ dv du 
y dx dx du dx 


du \jc x 2 / 




Hence, upon introducing the new variables u and v into equation (4-44), we 
get the first-order equation 


dv _ F(u, v ) 
du u — v 


It is easy to see that equation (4-44) is the general isobaric equation 
with 1=1. For this case the group-theory method is the same as the method 
given in section 4.I.3.2. In fact, all the methods of solution given up to now 
in this chapter are equivalent to the group-theory method when used in con- 
junction with certain well-known groups. 

For example, the equation with the dependent variable missing, treated 
in section 4.1.1, is invariant under the translation group 


X\=X 

yi = y+ a 


And the equation with independent variable missing, treated in section 4.1.2, 
is invariant under the translation group 

X\ = x + a 

y i = y 
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The isobaric equation with /=0, treated in section 4.1.3.2, is invariant under 
the affine group 


X\ = e a x 


y i = y 

And the homogeneous equation treated in section 4. 1.3.1 is invariant under 
the affine group 


X\ = x 


y\ = e?y 


The general linear equation discussed in section 4.1.4 is invariant under the 
nonuniform-distortion group 


X\ = x 

y i = y+a(f){x) 

where <j> is any homogeneous solution of the equation. 

In each of these cases the group-theory method is entirely equivalent to 
the method already introduced. 

4.1.6 Exact Equations of the Second Order 

We introduced the exact equation of the first order in section 2.2. We 
shall now extend the ideas presented therein to the second-order differential 
equation 


F(y",y',y,x)= 0 (4-48) 

We say that the differential equation (4—48) is exact if there exists a function 
<l>(y', y, x) such that 


(4-49) 


/r = # 
dx 
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Suppose that such a function <j> exists. Then every solution of equation 
(4—48) must also be a solution of the first-order differential equation 

<j>(y',y,x) = C 1 (4-50) 

for some constant C\. Hence, the problem of solving equation (4-48) can be 
replaced by the problem of solving the first-order equation (4-50). We can 
therefore say that equation (4-48) has been reduced to the first-order equation 
(4-50). Upon recalling the definition given in section 1.3 we see that <}> is a 
(first) integral of equation (4-48). 

It follows from the chain rule that 


d(f) d<fr 

dx dx 


(hf> 

dy 


M y " 

dy' y 


(4-51) 


where the variables x, y, and y' are treated as independent in forming the 
partial derivatives. Hence, it follows from equation (4-49) and the fact that (f> 
does not depend on y" that the differential equation (4-48) must be an 
equation of the form 

F(y'',y , ,y,x)=f(x,y,y')y' , +g(x,y,y')=0 (4-52) 

where 41 

f(x,y,p) = <f>p (4-53) 


g(x,y,p) = (f> x + P(t>v (4-54) 

It is now easy to verify by substituting in equations (4~ 53) and (4 _ 54) that/ 
and g satisfy the conditions 


fxx + 2 pfxy + p 2 fy V gxp + Pgyp gy 
fxp + Pfyp “b 2/ y = gpp 


(4-55) 


41 As usual, we have put p = y. The subscript notation for partial derivatives is being used. 
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We have therefore shown that, if the general second-order equation 
(4—48) is exact, it must be of the first degree and, hence, of the form (4-52). 
And the functions / and g in this equation must satisfy conditions (4-55). 
Then the equation will have a first integral <f> (and therefore can be reduced 
to a first-order equation) which can be found by integrating equation (4-53) 
with respect to p (at constant x and y) to obtain 

<f>= j f(x,y,p)dp + h(x,y) (4-56) 

where h is an arbitrary function which arises from the integration. Now if 
the differential equation is exact, it will always be possible to determine the 
function h so that, when equation (4-56) is substituted into equation (4-54), 
the latter equation will be identically satisfied. 

In order to illustrate the method, consider the equation 


F(y",y',x) =xyy"-\-xy' 2 + yy’ = 0 (4-57) 


For this equation the functions / and g (in eq. (4-52)) are given by 


f=xy 


g= xy' 2 + yy' = xp 2 + yp 


(4-58) 


It is easy to verify that these relations satisfy conditions (4 -55). Hence, equa- 
tion (4-57) is an exact equation. 

Substituting the first of equations (4—58) into equation (4-56) shows that 

<f)=pxy+h(x, y) 

and substituting this and the second equation (4-58) into equation (4-54) 
shows that 


xp 2 + yp = yp + h x + p 2 x + phy 


But this equation is satisfied by taking h = constant. Hence, equation (4-57) 
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has the first integral 

<j) = pxy=y'xy=C l (4-59) 

where C\ is a constant. But this first-order equation is separable and can be 
integrated immediately to obtain the general solution 

Y = Ci lnx + C 2 

of equation (4-57). 

Sometimes a first integral of an exact equation can be found by inspection 
simply by collecting terms and writing the equation in the form (4-49). Thus, 
since 

jL yy' =y'2_|_yy' 

the differential equation (4-57) can also be written as 

F(y',y,x)=x ^ (yy ' ) + (yy' ) =0 

or 

F(y\ y, x )=-^ x yy ' = 0 

Hence, 

<f) = xyy' 

which is the same as equation (4-59). 

The general linear equation (4-3) is of the form (4-52) with 

f(x, y, y') = Po(x) 

and 

g(x , y, y') = pi (x)y' + p 2 (x)y - p 3 (x) 
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The second condition (4-55) for exactness is automatically satisfied; and the 
first condition is satisfied if, and only if, 


(fipo 

dx 2 


dpi 

dx 


+ p 2 — 0 


(4-60) 


And when this is the case, equation (4—56) becomes 

<f> = po(x)p + h(x, y) 

But substituting this into equation (4-54) shows that 

/ dpo , dh\ dh 

\~d^- p ' + Yy) P = Pty ~Yx ~ P3 

Hence, it follows from equation (4-60) that 





+ P 3 — 0 


It is easy to see that this equation will be satisfied when h is given by 


*=-(£- 


pi 



p 3 (x)dx 


Hence, we have shown that the first integral of the general, linear, exact 
equation is 


(f> = po(x)y’ “(“ft? “P») y~ J Ps(x)dx= Ci 


This is a linear first-order equation and can, therefore, be solved by the methods 
of section 2.3. 

Just as in the case of first-order equations, we can sometimes find an 
integrating factor rj(x, y, y') for an equation of the form (4-52) which is not 
exact. Thus, a function rj(y', y, x) is an integrating factor of equation (4-52) if 
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7)F — 0 


is an exact equation. 

For example, Liouville’s equation 

y" + g{y)y' 2 +f(x)y' = o 


is not exact; but, upon multiplying through by 17 — 1 /y', we obtain the equation 

y,y" +g(y)y' +f(*) = 0 (4-61) 


which can be written as 


[In / + K(y) +*(*)] =0 

where Y(y) = / g(y)dy and X(x) = f f(x)dx. 

Equation (4-61) is therefore exact and its first integral is 


<f> = In y' + Y + X=C 0 


But, upon putting C i = e c o, we obtain the separable equation 

y' e Y = Cie~ x 

which can be immediately integrated to obtain the general solution 



/ 


e~ x(x ^dx + 


C 2 


to Liouville’s equation. 

Let po(x), pi(x), and p 2 (x) be the coefficients of the linear homogeneous 
equation 
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Po/'+Piy' +p 2 y=0 (4-62) 

and let r)(x) be a nontrivial solution of the equation 

Por)" + (2p' 0 — pi)r]’ + (po _ p'x +P2)i? = 0 (4-63) 

which can also be written as 

(ypo)"— (t?Pi)' + t?P2 = 0 (4-64) 

Since equation (4—60) is a necessary and sufficient condition that equation 
(4—3) be exact, equation (4-64) shows that the equation 

(iiPo)/+ (i7Pi)y' + (i7P2)y=o 

is exact. But this implies that 7) is an integrating factor for equation (4-62). 
The linear homogeneous equation (4-63) for the integrating factor rj is known 
as the adjoint equation of equation (4 -62). 

4.2 EQUATIONS WHICH ARE EQUIVALENT TO A LINEAR EQUATION 

4.2.1 Equations Which Can Be Transformed Into a Third-Order Linear Equation 

In section 2.4, we have seen that the first-order Riccati equation could 
be transformed into a second-order linear equation by the change of variable 


u = e 


- J rWh(x)dx 


(4-65) 


where h(x ) is not identically zero in the interval of interest. 

Since linear equations are generally much easier to solve than nonlinear 
equations, the additional complexity incurred by raising the order of the equa- 
tion is often justified. We shall now show that the change of variable (4-65) 
transforms the second-order nonlinear equation 


y"+p(x)y"-3h(x)yy' =f(x) +g(x)y+ [A'(x) +p(x)h(x)]y 2 — A 2 (*)y J 

IE 

(4-66) 
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where p(x), h(x), and g(x ) can be any functions of x, into a linear equation 
of the third order. To this end we differentiate equation (4-65) three times in 
succession to obtain 


u' = — yh{x)u (4-67) 

u" = (y 2 h 2 — y'h + yh')u (4—68) 

u"' =- h(y" + 3hyy' +^-y'+^- y-Sfh’ + h 2 y^ju (4-69) 

Substituting for y" from equation (4-66) into equation (4-69) gives 

u'''=-h[^- P y+(^+gy+f+(ph-2h')fy 

And using equation (4-68) to eliminate y' in this equation gives 

u,, ' + ( p- x) a " +v “ =_ ^T +g_ ^ +p f) 

Hence, it follows from equation (4-67) that u satisfies the third-order linear 
equation 

u'" + Pl (x)u" + qi (x)u' + ri (x)u = 0 (4-70) 


where 


Pi(x)=P~ 



, x h", 0 h' 2 , h' 

g 1 U)=X + ^ _ 2 X +p X 


ri (x)-hf 
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For example, consider the equation 


y" + 3yy' + y 3 = 0 (4-71) 

This equation is of the form (4-66), with p=f=g— 0 and h = — 1. Hence, 
Pi — Q\ — n = 0, and equation (4-70) becomes 

u'" = 0 


But this equation has the solution 

u = C o + CiX + C 2 x 2 

which can be substituted into equation (4-67) to show that equation (4-71) 
has the general solution 


2jc+6i 

y x 2 + b\x + bo 

where we have put 61 = C1/C2 and bo = C 0 IC 2 . 


4.2.2 Equations Which Are Equivalent to a Second-Order Linear Equation 


We shall now consider a class of second-order nonlinear equations whose 
solutions can be expressed in terms of the solutions of second-order linear 
equations. The work on this problem began with Painleve (ref. 11) and was 
carried on by Herbst (ref. 12), Gergen and Dressel (ref. 13), and Pinney (ref. 14). 
We shall present only the results here without proving any of the assertions. 
The references given should be consulted for details. Thus, for any functions 
w(x) and q{x) of x, any function Y(y) of y, and any constant a, the differential 
equation 


1 -r(y) 

Y(y) 


/ 2 . 


w'(x ) 
w(x) y 


t 


q(x) + aw 2 (x ) exp 


H &)} 


Y(y) 


(4-72) 


has a solution of the form 


y= 0 (g) 1 / 2 ) 


(4-73) 
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where 0 is the inverse of the function 6 defined by 

Q(y) ~ ex pfyjy) (4-74) 

(that is, 0[0(y)] — y) and the function co is given by 

aj = CiU 2 l -hC 2 u^ + C3UiU2 (4-75) 

where C i, C 2 , and C 3 are constants and u x and u 2 are two linearly independent 
solutions of the linear differential equation 

_«/[*),_ k) B = 0 (4-76) 

w(x) 

For example, consider the equation introduced by Painleve (ref. 11) 

y"+^y' 2 +f(x)y'=g(x)y (4-77) 


where 6 is a constant and / and g are any functions of x. This is a special case 
of equation (4-72) with (1 — Y')/Y= b/y, w' lw = —f, a = 0, and q= (l + 6)g(:*;). 
Hence, we can take F=y/(6 + 1) and equation (4-74) becomes 


0 = exp (6+1) jy- =exp (6+1) In y = y< 6+1) 


Therefore, 


0 ( 0 ) = 0 l/(6 + 1) 


In this case the differential equation (4-76) is 


u" + fu' — (l + 6)#u = 0 


(4-78) 


And the solution to equation (4-77) is given by 
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y = 0(ft) 1/2 ) = ((^ 1 / 2 ) l/(X + 6) = (CmZ + CzUl + CsUiUj'IM'+W 


(4-79) 

where «i and u 2 are two linearly independent solutions of equation (4-78). 
It can be verified by direct substitution and by applying equation (4-78) that 
equation (4-77) will be satisfied if, and only if, 


C 3 = 2VC\ Vc, 


Hence, equation (4-79) becomes 

y=(Vc[u l + Vcl u 2 ) 1/(1 + 6) 

But since VCi «i + VC^ u 2 is also a solution of equation (4-78), this is equiva- 
lent to taking y = n 1 / (1 + 6) where «is a general solution of equation (4-78). 

Next consider the equation (ref. 14) 

y" +p(x)y+Mjr 3 = 0 (4-80) 

where p(x ) is any function of x and M is a constant. This is a special case of 
equation (4-72) with w = constant = Co, Y=y, q(x) = — p{x), a = M/Cl. 
Hence, 0(y) = y and, therefore, its inverse is O(0) = 0. But this shows that 

y=0 (o> 1/2 ) =ft> 1/2 = (Ci« ? + C 2 u% 4- C 3 U 1 U 2 ) 1/2 (4-81) 

where u x and u 2 are linearly independent solutions of 

u"+p{x)u = 0 (4-82) 

Let J denote the Wronskian of the solutions U\ and u 2 ; that is, 

j — uiu ' 2 — u[u 2 (4-83) 

Upon differentiating equation (4-83) with respect to x and using the fact that 
and u 2 satisfy equation (4-82), we find that 42 


42 Or see section 5.9. 
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which shows that J= constant. Substituting the solution (4-81) into equation 
(4-80), using equation (4-82), and using the fact that its Wronskian is constant, 
we find that equation (4-81) is a solution of equation (4-80) if, and only if, 

q=2 @[+c,c,) 


4.3 MISCELLANEOUS METHODS 
4 . 3.1 Change of Variables 

Frequently, a good choice of new dependent and independent variables 
will convert an equation to a simpler and more easily analyzed form. Some pro- 
cedures which have proved helpful for this purpose are given in this section. 

4 . 3 . 1. 1 General transformation of linear equations. — The second-order 
linear equation (4-3) is transformed by introducing a new dependent variable 
v which is of the form 


v=f(x)y (4-84) 

and a new independent variable t which is of the form 

t=g(x) (4-85) 

into an equation which is also linear and of the second order. 

The change of variable (4—84) transforms the linear homogeneous equation 


y" +P(x)y' + q(x)y= 0 (4-36) 


into an equation 


v" +pi(x)v’ -\-qfx)v = f) (4-86) 

of the same type. It is clear that equation (4-86) can be transformed back into 
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the original equation (4-36) by a change of variable which is also of the form 
(4-84). Any two equations which can be transformed into one another by a 
change of variable of the type (4-84) are said to be equivalent. The new coeffi- 
cients P\{x) and q, (*) of equation (4-86) are related to the original coefficients 
p(x ) and q(x) by 


1 , 1 2 1 , 1 2 
q ~2 P ~4 P t = *~2 p '~4 p i 

Hence, we may say that the quantity 

^ = Q~\p' ~\p 2 (4-87) 

remains invariant under the transformation (4-84). It is therefore called the 
invariant of equation (4-36). We have seen that any two equivalent equations 
have the same invariant. It can also be shown that, conversely, any two equa- 
tions with the same invariant are equivalent. 

If the solution of one equation of the form (4-36) is known, it is possible 
to find the solution of all equations which have the same invariant. Thus, when 
a new equation is encountered, we can compare its invariant with those of 
equations whose solutions are known. If we can find one which is the same, 
we will have succeeded in solving the original equation. In particular, the ad- 
joint equation (see section 4.1.6) of equation (4-36) has the same invariant </ 
as equation (4-36) and is therefore equivalent to it. 

If the coefficient p(x) in equation (4-36) is identically equal to zero, we say 
that the equation is in normal* 3 form. In this case the invariant (4-87) is given by 

J —q 

In the general case the equation (4-36) with invariant (4-87) can always be 
transformed into the normal equation 


v"+Sv = 0 (4-88) 

by a change of variable of the form (4-84). Then equation (4-88) is said to be the 


43 Notice that in this case the meaning of term normal is different from that of section 1.4. The proper interpre- 
tation of this term should always be clear from the context. 
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normal form of equation (4-36). It is clear that any given equation has only one 
normal form and that all equivalent equations have the same normal form. 
The normal differential equation 

0+ ✓(*)?- 0 

can always be transformed into the normal equation 

fftl) 

by changing both the dependent and independent variables by transformations 
of the types (4—84) and (4—85), respectively, provided the nonlinear differential 
equation 


/ dt \ 4 f , v / dtV , ld?tdt^ 3/dHV 

can be solved for the new independent variable t. When this is the case, the 
function / in equation (4-84) which determines the new dependent variable is 

/(*) = 

A fuller discussion of this topic as well as proofs of the various assertions 
made in this section can be found in Rainville (ref. 15, chapter 1). 

4 . 3 . 1.2 Transformation to an equation with constant coefficients. — 'We 
have seen that the homogeneous Euler equation (section 4. 1.3.2) can be trans- 
formed by a change of independent variable into an equation with constant 
coefficients (which we know how to solve). More generally, the linear homo- 
geneous equation 



/' +P(x)/ + q(x)y= 0 (4-36) 

can be transformed into a linear equation with constant coefficients by a change 
of independent variable if, and only if, 
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fa* 2 ™ 

— — = constant (4-89) 

When condition (4-89) holds, the change in variable (4-85) is given by 


t = c I y/q(x)dx (4-90) 


where c is any constant. 

In order to prove this, it is only necessary to substitute the change in vari- 
able (4-85) into equation (4-36) and obtain the transformed equation 

cPt dt 

cPy dx 2 P dx dy . g 
dt 2 (dtV dt (dty y 

\dxj 

It is now easy to see that the coefficient of y in this equation will be constant 
if, and only if, t is given by equation (4^90). Further, substituting (4^90) into 
the coefficient of dyldt, we find that this coefficient will also be constant if, 
and only if, condition (4-89) is satisfied. 

Thus, for example, for the homogeneous Euler equation, p(x)=polx 
and g{x ) = qolx 2 . Hence, equation (4-89) is satisfied, and equation (4-90) for 
the new independent variable becomes in this case 

t = c j ~~ dx — cq l J 2 In x 

The change of variable (4-30) given in section 4. 1.3.2 is a special case of this. 

4 . 3 . 1.3 Interchanging of dependent and independent variables. — Differ- 
entiating the identity 

dy _ /&\-> 
dx \dyj 

with respect to x shows that 
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d 2 y _ cPx / dx \ ~ 3 
dx 2 dy 2 \dy) 


And when these relations are substituted into the general differential equation 
(4-1), we obtain the equation 



in which x is the dependent variable and y is the independent variable. Some- 
times this change of variable will result in an equation whose solution is 
known or can be found. 

4 . 3 . 1.4 Legendre transform. — The Legendre transform which consists 
of introducing the new independent and dependent variables p and q, re- 
spectively, defined by 


_ (fy 

dx 


P = 


(4-91) 


dy 

X dx~ y 


(4-92) 


can be used to radically alter the form of a differential equation. It follows 
from these relations, after differentiating equation (4-92), that 


dq = xdp + pdx — dy 


dy = pdx 


And when these equations are solved for x and y, we find that the inverse 
transformation is given by 


x = 


dq 

dp 


y = 



- Q 


But it follows from equations (4-91) and (4-93) that 


(4-93) 

(4-94) 
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d 2 y _ 


= /d!s) 

dx 2 

Up/ 

UpV 


(4-95) 


Hence, substituting equation (4-91) and equations (4-92) to (4-95) into 
equation (4-1) yields the transformed differential equation 


F 



= 0 


which is also a second-order equation. If a solution q = f(p) of this equation can 
be found, it can be substituted into equations (4-93) and (4-94) to obtain the 
parametric equations 


y—pf'ip) ~ f(P ) 

x=f'(p) 


of a solution y of equation (4-1). 

4.3.2 Equation Splitting 

When equations which are split in some natural way into sums, quotients, 
or products of terms, such as 

/(y"> y\ y , x)+g(y", y\ *) = o 

or 


/(/,y'»y>*) 

g(y",y\y,x) 

are encountered, it is sometimes possible to obtain a solution by putting 
f(y",y'iy,x) = h(x) =-#(/, y\y,* ) 
for equations of the first type and 
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f{y",y',y,x) = ch(x\ 
g(y",y\y,x) = h(x) 


for equations of the second type. If h{x) can be chosen so that a common 
solution to the pair of equations can be found, this solution will also be a 
solution of the original equation. 

4.3.3 Tables of Differential Equations and Solutions 

Two valuable catalogs of solutions to differential equations can be found in 
the volumes by Murphy (ref. 16) and Kamke (ref. 17). Murphy lists over 2000 
solved equations which are classified according to order and degree; Kamke 
gives about 1500 equations along with their solutions and references to the 
literature. 
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CHAPTER 5 


Review of Complex Variables 

A general procedure for solving second-order linear equations will be 
given in chapter 6. But this procedure involves the use of power series, and 
the discussion of power series becomes much simpler when it is carried out 
in its natural setting— the complex plane. In order to take advantage of this 
fact we shall extend the definition of a differential equation given in chapter 1 
to include the case where the variables which occur in the equation are complex. 
Thus, by considering a more general situation we are actually able to simplify 
the treatment. Another reason for making this extension is that it allows us to 
see how solutions are connected across the singular points of the equation. 

In this chapter, those concepts from the theory of functions of a complex 
variable which are needed for this purpose will be reviewed. The treatment 
is essentially descriptive; and rigorous proofs of the various assertions are, 
for the most part, omitted. For a more detailed treatment of the topics covered 
herein (including the omitted proofs), as well as a more complete coverage of 
the vast field of complex variables, the reader is referred to the many excellent 
texts 44 which are devoted entirely to this subject. 

5.1 COMPLEX VARIABLES 

Let x and y be two independent real variables and, as is the usual practice, 
put i= V^l. Then z = x + iy is a complex variable. It is frequently convenient 
to think of the values of 2 as points in a plane, called the complex plane , whose 
Cartesian coordinates are x and y. 

44 A good elementary treatment is given by Churchill (ref. 18). A more advanced and theoretical treatment is 
given by Ahlfors (ref. 19), while the text by Carrier, Krook, and Pearson (ref. 20) emphasizes advanced applications. 
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Figure 5-1. — Polar representation of complex number z. 


Instead of using the Cartesian coordinates x and y 9 we can also use the 
polar coordinates r— VV+y 2 and 0=tan-! y/x to locate points in this plane. 
(The relation between the polar and Cartesian coordinates is illustrated in 
fig. 5—1.) Then 


z — x + iy— r cos 6 + ir sin d 


And upon using Euler’s formula, we obtain the polar representation 


z — r (cos 6+ i sin 6) = re i0 


of the complex number 2 . Notice that for n= 0, ±1 , ± 2, . . . 


e i 2 mr= cos 2mr+ i sin 2mr— 1 


Hence, 


8 tt zzz j'g i{ 8~)~ 2/i7r) 

The definitions of a domain and a neighborhood have been given in 
section 1.1 for a general n-dimensional space. We shall continue to use these 
definitions in the two-dimensional complex plane. 
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The complex conjugate of the variable z is denoted by z* and defined by 
z* = x — iy. 

The absolute value or modulus of the complex variable z is defined to be 
the length of the vector joining the origin with the point z in the complex 
plane and is denoted by |z|. Therefore , 45 


z | = Vx 2 + y 2 = Vzz* 


5.2 ANALYTIC FUNCTIONS OF COMPLEX VARIABLE 

Let u(x, y) and v(x,y) be any two real-valued functions 46 (of the variables 
x and y) which are defined in some region of the complex plane. Then w = u+ iv 
is a complex-valued function of x and y. Since w associates a complex number 
with each point z = x + iy of some region of the complex plane, we say that 
w is a function of the complex variable z. 

We shall consider only a particular class of functions of a complex variable 
called analytic or holomorphic functions. In order to define this class of func- 
tions we first introduce the concept of a complex derivative. To this end, let 
w = u + iv be a function of the complex variable z and suppose that x and y are 
changed by the amounts Ax and Ay, respectively. Then ui changes by an amount 
A w= A u+ i At>. Now, by analogy with the definitions of the derivative of a 
real-valued function of a real variable, we define dw/dz, the derivative 47 of w 
with respect to z at tfie point z, to be the limit 


dw 

dz 


= lim 

A*-»0 

Ar-0 


Au-f ikv _y Am; 
Ax + iAy a^o A z 

y Ay *0 


provided that this limit not only exists but that it is independent of the manner 
in which A* and Ay approach zero. When Ax and Ay approach zero in some 
prescribed manner, Az approaches zero along some path in the complex plane, 


45 Notice that |z| is equal to the polar coordinate r. 

46 Recall that according to the convention adopted in section 1.1 we assume that all functions are single valued 
unless explicitly stated otherwise. 

47 We shall frequently write w' (z) or w in place of dwldz and id n) {z) or id n) in place of d n w/dz n for n = 1, 2, . . . 
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Figure 5-2. —Typical path along which A z can approach zero. 


as indicated schematically in figure 5-2. But the definition implies that, if 
vo is to have a derivative at the point z, then Aw/Az must approach the same 
limit for every such path along which Az approaches zero. Although this 
requirement imposed on the complex derivative may seem unimportant, its 
implications are enormous. In fact, by allowing Az to approach zero along 
various paths, it can be readily shown (ref. 18, p. 34) that, if u and v are con- 
tinuously differentiable, 48 a necessary and sufficient condition for the ex- 
istence of the derivative dw/dz at the point zo is that u and v satisfy the two 
Cauchy-Riemann equations 

du dv , du dv 

- — = — - and — = — — 

dx dy dy dx 

at this point. This shows that much greater restrictions are imposed on those 
complex functions which possess complex derivatives in the sense of the 
definition given above than are imposed on the real functions which possess 
ordinary real derivatives. However, since the complex derivative is formally 
the same as the real derivative, the usual rules for differentiating sums, prod- 
ucts, quotients, etc., still apply (ref. 18, p. 31). 

A function w{z ) of the complex variable z is said to be analytic 49 or holomor- 
phic in a domain D if it possesses a derivative at every point of D. Frequently, 


48 This means that the partial derivatives d ufdx, d ujdy, d vfdx, and dv/dy exist and are continuous. 
4 * 1 he terms regular and monogenic are also used. 
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when it is of no consequence in the discussion, the reference to the domain D is 
omitted and we simply say that the function w(z) is analytic. A function is 
said to be analytic at a point zo if it is analytic in some neighborhood of this 
point. 

Notice that the identity function 

w = z — x + iy 

is analytic at every point since the Cauchy-Riemann equations (with u = x and 
v = y) are always satisfied. However, the complex conjugate of this function 

w = z* = x — iy 

is not analytic at any point since in this case du/dx — 1 and dv/dy — — 1 and 
therefore the Cauchy-Riemann equations are never satisfied. 

The real and imaginary parts of the function 

— 2 — 1 _ z*_ _ xj^iy 

W z x+iy zz * x 2 + y 2 


are 


u = 


x 

x 2 + y 2 


and 


v = 


- y 

x 2 + y 2 


respectively. And upon taking the partial derivatives of these functions we see 
that the Cauchy-Riemann equations are satisfied at every point except z= 0, 
where the partial derivatives fail to exist. Hence, w = 1/z is analytic at every 
point except z = 0. 

It is a remarkable fact (see ref. 18, p. 122) that any function w{z ) which is 
analytic at a point zo possesses derivatives of all orders at this point. And these 
derivatives are themselves analytic functions at this point. It is easy to see that 
the sum and product of any two analytic functions are analytic within any 
domain in which both functions are analytic. Usually, any function which is 
obtained from a real algebraic, elementary-transcendental function (trigono- 
metric, exponential, logarithmic, etc.) or a common higher-transcendental func- 
tion (Bessel function, hypergeometric function, etc.) by replacing the real 
variable x by the complex variable z=x + iy is an analytic function within some 
domain. 
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5.3 CONFORMAL MAPPING 50 

According to the definition given in section 5.2, a complex function 
w=u-\-iv of the complex variable z = x+iy associates a pair of numbers 
(u,v) with each point ( x,y ) of some region of the complex z-plane. We can 
interpret these numbers as coordinates of a point in a complex ic-plane. Thus, 
we may think of the function w(z) as a transformation or a mapping of some 
region in the z-plane into some region in the tc-plane. More specifically, we 
can think of an analytic function w(z ) which is defined on a domain D in the 
z-plane as a mapping of D onto a region R in the ic-plane, as shown schematically 
in figure 5-3. In order to determine the properties of an analytic function w(z) 
it is frequently helpful to study the manner in which this function transforms 
various points, curves, or domains in the z-plane into corresponding points, 
curves, or regions in the ic-plane. 



Figure 5-3.— Mapping of D onto R by w(z). 


Thus, the transformation w = \ jz associates a single point in the tc-plane 
with each point in the z-plane except the origin z = 0. It maps all those points 
lying outside the circle with radius R and center at the origin in the z-plane 
into the interior of the circle with radius 1/R and center at the origin in the 
ie-plane. Although there is no point of the z-plane which maps into the point 
ic = 0, we can, by choosing R sufficiently large, make the points outside this 


50 A comprehensive treatment of this subject can be found in Nehari (ref. 21). 
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circle map into a circle of arbitrary small radius about w= 0. However, it is 
frequently convenient to “complete” the 2 -plane by adding a fictitious point 
2 =oo which maps into the point w= 0 under the mapping w = 1/z. Thus, when 
we speak of the behavior of an equation or a function at the point z = ° °, we 
actually mean the behavior at the point w = 0 of the transformed equation or 
function obtained by putting z=\/w. When it is necessary to distinguish 
between the point 2 = 00 and the other points of the complex plane, the latter 
are said to be finite points. When the complex plane includes the point 2 = 
it is called the extended plane ; and when 2 = °° is excluded, it is called the 
finite plane. 

Let w(z ) be analytic in a domain D and let 2 0 be any point of D at which 
dw/dz 7^ 0. Then w(z) transforms any two smooth curves passing through zo 
in such a way that their image curves intersect at the point wo= w(zo) with 
the same angle (in both magnitude and sense of rotation) as the original curves 
in the z-plane (ref. 18, p. 174). Thus, the mapping “preserves angles,” and we 
say that w(z) is a conformal mapping at all points where dw/dz ¥= 0. 

A mapping which is particularly useful for the treatment of certain types 
of linear differential equations is the linear fractional transform 


w 


az + b 
cz+ d 


(5-1) 


Notice that, if ad — bc = 0, this transformation reduces to w= constant. In 
any other case, it transforms each point in the extended z-plane into a point 
in the extended w-plane in such a way that no two points in the z-plane map into 
the same point in the ic-plane. In addition, there is a point in the z-plane which 
maps into each point in the w-plane. For this reason, we say that a linear frac- 
tional transformation with ad— be 0 is nonsingular. 

If we consider straight lines and points as being degenerate circles (i.e., 
circles having infinite or zero radii), we can say that the linear fractional 
transformation always maps circles into circles. 

Performing two nonsingular linear fractional transformations in suc- 
cession is equivalent to performing a single nonsingular linear fractional 
transformation. In fact, any nonsingular linear fractional transformation can 
be performed by carrying out not more than four successive transformations 
each of which has one of the three elementary forms 

w=az (rotation and stretching) (5-2) 
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w = (3 + z (translation) (5-3) 

w = — (inversion) (5-4) 

z 

In order to prove this, first suppose that c ¥=■ 0. Then equation (5-1) can 
be written as 



(5-5) 


Now transform the z-plane into the £i-plane by the transformation 



Then transform the ti-plane into the t 2 -plane by 

1 



and the £ 2 -plane into the £ 3 -plane by 



Finally, transform the £ 3 -plane into the w-plane by 


a , 

W — — r £ 3 
c 

Upon combining these successive transformations we obtain equation (5-5) 
and therefore equation (5-1). This proves the assertion for the case where 
c # 0. When c = 0, equation (5-1) reduces to 
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b 

d 


which is easily seen to be equivalent to the succession of transforms 


a , . b 

ti = -,z and w=ti+~, 
d a 


If the transformation (5-1) is to be nonsingular, at least two of the con- 
stants a, b, c, and d must be nonzero. By dividing through by one of these, 
it is easy to see that equation (5-1) in fact contains only three arbitrary con- 
stants. It is therefore not surprising that these constants can always be chosen 
so that the linear fractional transformation maps any three given points in the 
z-plane, say Zi, Z2, and Z 3 , into any three given points, say w\, w%, and M>3,in 
the loplane. Thus, for example, the linear fractional transformation which 
takes z = zi into u; = 0, z = Z2 into w= 1, andz = Z 3 into m; = 00 is 


w 


Z 2 — Z 3 Z — Zi 
Z2—Z1 Z — Z 3 


5.4 ISOLATED SINGULAR POINTS OF ANALYTIC FUNCTIONS 

If a function w(z) is analytic at every point in some neighborhood of a 
point zo except at the point zo itself, 51 then zo is called an isolated singular point 
or an isolated singularity of the function w (z) . Thus, the function 

= H7TW <5 ' 6) 

has isolated singularities at the points z = 0 and z = —1. 

An isolated singular point zo of the function w(z) is called a removable 

singularity if the limit of w(z) as z -* z 0 is equal to some finite number. Let k be 
a positive integer. An isolated singular point zo of the function w{z) is said to be 
a pole of order k if the limit as z^zo of the quantity (z-zo)Mz) is equal to 
some finite nonzero number. A pole of order one is called a simple pole. 


51 The function need not even be defined at the point Zo. 
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Thus, for example, the function given by equation (5-6) has a simple pole 
at z = 0 and has a pole of order two at z — — 1. 

Because of the way analytic functions arise in practice, it turns out that we 
sometimes arrive at a function w(z) which is not defined at a point z o but is 
defined and analytic at every point of a neighborhood of Zo. The point z o is 
thus an isolated singular point of w(z). But if this point is also a removable 
singularity, the function can be made analytic at z o simply by assigning a suit- 
able value to w{z) at this point (ref. 18, p. 158). For example, since division by 
zero is undefined, the function w(z) given in equation (5-6) is undefined at 
z = 0. Thus,0 X m;( 0) is also undefined. Hence, the function £(z) defined by 

£(z) = zw(z ) 

is not defined at the point z = 0. However, upon defining £(0) by 


£(0) = lim £(z) = 2 

z~* 0 


we obtain a function which is analytic at z = 0. 

It is easy to see that, if w(z) has a pole of order k at zo, the function 


£(z) s (z — Zo)*M>(z) 


has a removable singularity at z o and is therefore “essentially” analytic at this 
point. 52 We shall sometimes say that an analytic function has a pole of order 
zero at the point zo if zo is a removable singularity of this function. 

Any isolated singular point of an analytic function which is not a pole or 
a removable singularity is called an essential singularity. For example, the 
function sin(l/z) has an essential singularity at z = 0. 

There are some important differences between poles and essential singu- 
larities. For example, if w(z) has a pole at zo, the function 1/m;(z) is analytic 53 


52 For the purposes of this book we can assume that any analytic function which is encountered has already 
been defined at its removable singularities in such a way that it is analytic at these points. 

53 In fact, it is equal to zero at z 0 (see preceding footnote). 
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at zo; but if w(z) has an essential singularity at zo, so does 1/m;(z) (ref. 22, p. 
110). A pole, then, is a point where a function w(z) is not analytic only because 
its modulus \w(z ) | becomes infinitely large at this point and for no other reason. 

A function which is analytic at every finite point of the complex plane is 
called an entire function. And Liouville’s theorem (ref. 18, p. 125) states that 
any entire function which is also analytic at infinity (see section 5.3) must, in 
fact, be equal to a constant. 

A polynomial is a function of the form ao+fliz + . . . + anZ n where 
ao, . . . , a n are complex constants. It is an entire function, and it has a pole 
of order n at infinity. A rational function is the ratio of two polynomials (which 
may be chosen to have no linear factors in common). It is therefore a function 
of the form 


w(z) 


ao+ aiz+ a2Z iJ r . . . + anZ n 
6o + b\z-\- '& 2 Z 2 + • • • “I ~bmZ m 


a„ # 0; b m ¥=■ 0 


(5-7) 


and is analytic everywhere in the finite plane except at those points where its 
denominator is equal to zero. These points are poles of w{z). If m 5= n, then 
w(z) is analytic at the point z =0 °; otherwise it has a pole of order n — m at 
this point. 

A function which is analytic at every point of a domain D except at those 
points of D where it has poles is said to be meromorphic in D. For example, 
the rational function (5^7) is meromorphic in the entire finite plane. In fact, 
any function which is meromorphic in the entire finite plane and has a pole at 
infinity is necessarily a rational function (ref. 20, p. 60). A function which is 
meromorphic in the entire finite plane can have at most a finite number of 
poles in any domain D of finite size. However, it may have infinitely many 
poles if D is infinitely large. 54 

In view of Euler’s formula (see section 5.1) it is natural to define the func- 
tion e z by the formula 


e z = e x+iy = e x cos y _|_ (gX g j n y 

Then e z is an entire function and has an essential singularity at z = °°. We can 
now define the functions sin z and cos z by the formulas 


54 For example, D could be the entire plane or the upper half plane. 
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giz g-iz giz-f- g-iz 

sin z = ^ and cos z = 

which extend in a natural way the definitions given to these functions when 
the variables are real. They are also entire functions with essential singularities 
at z =co. On the other hand, the function 1/sin z is meromorphic in the entire 
finite plane; and its poles are located at the points z = mr for n — 0, ±1, — 2, 
etc. It also has an essential singularity at z=°°. 

5.5 POWER SERIES 

A power series about a point zo is an infinite series of the form 

^ a n (z — z 0 ) n (5-8) 

n= o 

in which the coefficients a„ can be any complex numbers. This series certainly 
converges at the point zo, which may be the only point at which it actually 
does converge. Or the opposite extreme could occur and the series might 
converge at every point of the finite plane. In all other cases the series will 
converge at every point within a circle of radius R and center at zo, called the 
circle of convergence of the series, and will diverge at every point which lies 
outside this circle. Thus (ref. 19), there exists a number R lying in the range 
0 oo called the radius of convergence of the series such that the series 

(5-8) converges at all points z which satisfy the inequality 55 |z — z 0 | < R and 
diverges at all points z which satisfy the inequality 56 |z — zo | > R • The question 
of whether the series is convergent for the points which satisfy the equality 
\z — z 0 | — R (i.e., points on the circle of convergence) is more subtle but un- 
important for our purposes. 

A power series with a nonzero radius of convergence R converges to an 
analytic function and can be differentiated term by term (i.e., the order of 
summation and differentiation can be interchanged) at every point within its 
circle of convergence. Thus, there exists a function w(z) which is analytic at 
every point within the circle |z — zo | < R such that 

55 The series is, in fact, absolutely convergent at these points. This means that the series still converges when 
each of its terms is replaced by its absolute value. See ref. 19 for more details. 

56 This result was first established by Abel. 
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w(z) = jr a„(z — zo) n 

n= 0 

= V na n (z — zo) n ~ 1 i 

aZ n= 1 


for |z — zo| < f? 


It can also be shown (ref. 19) that, when the series (5—8) has a nonzero 
radius of convergence R, there exists a positive constant M ¥= 00 such that for 
any number 0 < r < R the coefficients of the series satisfy the Cauchy estimates 



for n = 0, 1, 2, . . . 


This result will be used in the discussion of the solutions of differential equa- 
tions in the next chapter. 

Now suppose that w(z) is analytic in the domain D and that zo is any 
point of D. Then the power series 

V w (n) (zo) (z— z 0 ) n (5-9) 

„ Tt • 
n = 0 


( where u/ 0) (zo) = w{z ) ) converges to w(z) at every point z within the largest 
circle centered at zo lying entirely within D (ref. 18, p. 129). This series is called 
a Taylor series expansion of w(z) about zo. Its radius of convergence is at least 
equal to the shortest distance between zo and the boundary of D. It may be 
larger than this but we have no guarantee that the series will converge to w(z) 
at points which lie outside of D. The series representation (5-9) is unique in 

oo 

the sense that if ^ b n (z — zo) n is any power series which converges to w(z ) 

n - 0 

within any circle about zo, then necessarily (ref. 20, p. 49) 

b n = —.id n) (zo) for n = 0, 1,2, . . . 

n\ 


Next, suppose that w(z) is analytic at every point of a domain D except 
for a certain number of isolated singular points and let zo be a point of D at 
which w(z) is analytic. Then the circle of convergence of the Taylor series 
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(a) Circle of convergence does not intersect boundary of D. 

(b) Maximum circle of convergence intersects boundary of D. 

Figure 5-4. — Circle of convergence of a Taylor series. 


of w(z ) about zo passes through the nearest isolated singular point of w{z) 
if it does not intersect the boundary of D. In this latter case we can only assert 
that the radius of convergence does not exceed the distance between zo and the 
nearest of the isolated singularities of w(z) within D. These results are illus- 
trated in figure 5^1. Thus, in particular, if the function w(z) is analytic at every 
finite point of the complex plane except at a certain number of isolated singu- 
lar points, then the circle of convergence of its Taylor series expansion about 
any point zo where w(z) is analytic always passes through the nearest isolated 
singular point to zo- 

For example, we have seen in section 5.2 that w(z) = e z is an entire 
function. It is easy to see that the nth derivative of this function is 
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w (n \z) = e z for n = 0,1,2,. . . 
Hence, the Taylor series expansion of e z about 2 = 0 is 


e z 



z n 


(5-10) 


And this series converges in the entire finite plane. 

We have also shown in section 5.2 that the function w(z) = l/z is analytic 
at every point of the complex plane except the origin 2 = 0. The nth derivative 
of this function is 


, . , , (— l)"n! 

"“W = — J» 


Hence, its Taylor series expansion about the point 2 = 1 is 

1 


= 2 (-l)"(*-l) n 


n=0 


It is easy to verify that the radius of convergence of this series is equal to 1 
and that the circle of convergence passes through the isolated singular point 
2 = 0. Upon replacing 1 —2 by 2 in this series we obtain the geometric series 


7-^ = 2 2 " 

1-2 A 


(5-11) 


rc -0 


which converges within the unit circle \z\ = 1. This circle passes through the 
isolated singular point 2 = 1 of the function (1 — 2 ) -1 . 

We shall frequently find it necessary to add and multiply two power 
series. The sum of two power series about the same point can be obtained 
by adding the two series term by term. The resulting series will converge 
within the smaller of the two circles of convergence of the original series 

(ref. 23, p. 123). Now let V c« and Y d n be any two absolutely convergent 
series. Any expression for the product c„ ) ( 2 , 4 ) must certainly include 
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all terms of the type Cidj. But all terms of this type must belong to the array 


Codo + Codi + cock + • • • + cod n + ■ ■ • 


+ cido + C\di + d\ d% + . . . + Cid n + 


+ C2 do + C2 d\ + C2 dz + . • . + C2 dn + • • • 


“I” Cmdo 1 Cmdi Cmdi * » • "f” Cmdn “1“ « • • 

+ . . . 

And the series which is formed by grouping together the terms along the di- 
agonals of this array and summing the result over all diagonals is called the 
Cauchy product of the two series. Thus, the general term in the Cauchy 
product is 

n n 

a n — C n do+ C n -ldi + Cn_ 2 cfc+ • • . + Cod„= V C n -kdk= y Ckdn-k 

m “0 


00 

and the Cauchy product is the series y a n . Cauchy’s theorem (ref. 18, p. 147) 

n =0 

00 

states that the Cauchy product y a„ is absolutely convergent and that 

n= 0 


c -) (s dn ) = (,?„ a ") = % ctd 


The Cauchy product is particularly convenient to use when multiplying 
power series. For, in this case, we get 

[y a n (z — zo) n l [y b n (z — zo) n — y y ak{z — zo) k b n -k(z — zo) n ~ k 
1^0 J L^O J «= 0 rn b 

= y (z - z 0 ) n ^ a k b n -kj 
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which converges within the smaller circle of convergence of the two original 
series. 

For example, let us use these ideas to find the power series expansion of 
the function l/[(a— z) (b— z)], where a and b are arbitrary complex constants. 
To this end notice that the geometric expansion (5-11) implies that 


1 



a 


oo 


-2 

n — (\ 


a~ n z n 


and 


— = ^ b~ n z n 

£. n=0 

b 


Hence, upon forming the Cauchy product we obtain 

! = _L f 2 „ f ('IV ('IV-*' 

( a — z)(b — z ) jB i Si,\ a ) W 


But since 


/ l\ n+1 _ /l\ n+1 

A I 1 W 1 \ n ~ k _ \a/ \6/ 

£ 0 \a)\b) 1_1 


a b 


we find that 


1 


(a-z){b-z) 


= V [a-( n +D — 6 -<”+*> lz" 


Next, in order to find the Taylor series expansion of the function cos z/(l + z 2 ) 
about the point z=0, notice that by changing variables in the geometric expan- 
sion (5-11), we obtain the expansion 


1 

1 + z 2 


2 (-1 ) n 2 2 ” 

n = 0 


and that by using the expansion (5—9) and the definition of cos z given in section 
5.4, we obtain the expansion 
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cos z = ^ (— l) n 
n=0 


,2n 


(2 »)! 


Then, upon taking the Cauchy product of these two series, we obtain 


cosz 

Y+? 


= 2 (-i)"z 2n 2 

n-0 fc=0 


l 

( 2 *)! 


We have seen that a power series (with positive exponents) represents 
(converges to) an analytic function within a circle. Similarly, the series (ref. 
18, p. 134) 


^ a„(z — z 0 ) n 

n = ~oc 


(5-12) 


containing both positive and negative exponents, converges to an analytic 
function in an annular region 57 lying between two concentric circles centered 
at zo and of radii R i and R 2 with Ri < R 2 (i.e. , at all points z for which 
Ri < \z~ zo| < R 2 ). Conversely, any function w{z) which is analytic in an 
annular region Ri < \z — z 0 | < R 2 can always be represented by a series of 
the form (5—12) at every point of this region. This expansion is called a Laurent 
series. An important special case occurs when the function w(z) has an isolated 
singularity at the point z 0 and is analytic at every other point within the circle 
| z — zo| = R. In this case the series 


w(z)= ^ a„{z — zo)" 

n~-oo 

converges to w(z) at every point z of the punctured circular region 
0< \z— zo|</? bounded by the circle \z— Zo| = /? and the point zo. Then the 
isolated singularity zo is a pole of order k of the function w(z) if, and only if, 
a_ k 7 ^ 0 and a„ = 0 for all n < — k. Hence, if zo is an essential singularity, 
infinitely many negative powers of z — Zo will occur in the series. 

For example, we can use the geometric series (5-11) to obtain the expansion 


57 Provided it converges at all. 
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1 _1 1 
z 1 Z 1 


00 00 
^ z -n-l == £ z -« = 

n=0 


-1 

I 


n=-oo 


Z n 


of the function (z — l) -1 about the point z = °°. This series converges at all 
points which lie outside the circle |z| = 1. And since the series 


3 

3 — z 


3 - "z" 


converges within the circle |z| =3, we conclude that the series 


2z 

(z — 1) (3 — z) 




2 


n =— oo 


anZ" 


with 


for n = — 1, — 2 , . . . 
for n = 0, 1, 2, . . . 

converges in the annular region 1 < |z| < 3. 

5.6 COMPLEX INTEGRATION 

Let w{z) be analytic in a domain D. Then the integral J w(z)dz of iv(z) 

along a curve or path T which lies entirely within D is defined in terms of two 
real line integrals along T by 



^ w(z)dz = £ (udx — vdy) + i £ ( udy+vdx ) (5—13) 


It is easy to verify from this definition that the usual rules of integration 
still apply. Thus, in particular, if a and /3 are complex constants and Wi(z) 
and Wz{z) are analytic functions 
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J [awi{z) + fiW 2 {z)'\dz= otj^Wi{z)dz+ fi J_w 2 (z)dz 

and if the direction of integration along T is reversed, the integral is multiplied 
by — 1. 

For example, the integral of the function w(z) =z along the line y 0 = con- 
stant, from the point (0, yo) to the point (*o, To), is 

Jzdz = J xdx+ij yodx=^+ iy 0 X 0 = 2 (xo + iyo) 2 — 2 (iyo ) 2 

In order to integrate w(z)=z along a circular path centered at 2=0, it is best 
to use polar notation. Thus, let the radius of the circle be R. Then on this circle 
z=Re i0 and dz= iRe t6 dd. Hence, 




R 2 e 2ie d6—0 


More generally, it can be shown (ref. 18, p. Ill) that, if C is any closed curve 
within D and if w{z) is analytic at every point in the interior of C, then 


J w(z)dz = 0 

Therefore, if I\ and T 2 are any two curves in D which begin at the point z\ and 
end at the point z 2 (fig. 5-5), then 



provided w{z) is analytic within the domain enclosed by these curves. 

Because of this fact the exact path along which the integration is carried 
out is frequently unimportant; and when this is the case, we write 

I w(z)dz in place of I w(z)dz. 

J*i -' r 

The function m;(z) = 1/z has a singular point at the origin; and its integral 
along the circle of radius R centered at the origin is 
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(a) Line integrals along Ti (b) Line integrals equal along 

and r 2 not necessarily Tj and T 2 . 

equal. 

FIGURE 5-5. — Paths for line integral. (Arrows indicate direction of integration.) 


I — dz=i [ dd= 27 ri 

r=R Z JO 


If the function w(z) is an analytic function of z in D, the function F(z ) de- 
fined by 

F(z)= J w(z)dz 

J Z 0 

is also an analytic function oizmD and (ref. 18, p. 114) 


Hence, 


dFjz) 

dz 


= w{z) 


F{z) — 



F'{z)dz 


This shows that just as in the case of real variables, integration and differen- 
tiation are inverse processes. For example, in order to evaluate the integral 

f z n dz notice that, for rn^—l, 


DIFFERENTIAL EQUATIONS 


Hence, 


„ 1 dz n+1 

z n— 

n + 1 dz 



z n dz = 


1 

n+ 1 


I 


2 dz W+1 

dz 

0 


dz = 


Z»+> 

n + 1 


n + 1 


It can also be shown (ref. 18, p. 141) that any convergent power series can 
be integrated term by term and the resulting series will have the same circle 
of convergence as the original series. In fact, it can be shown that the product 
of any convergent power series about a point zo with a function of the form 
(z — zo) x , where X is a complex constant, can also be integrated term by 
term. 


5.7 ANALYTIC CONTINUATION 


5 . 7.1 Definition 

First, suppose that u;i(z) and Wi (z) are both analytic in some common 
domain D. It can be shown (ref. 18, p. 259) that if w 2 (z) at all points of 

some subdomain of Dor even at all points of some curve which lies entirely within 
D, then wi(z) and w 2 (z) are equal at every point of D. This assertion is known 
as the fundamental theorem of analytic continuation. It means that there is 
only one analytic function in a domain D which takes on any given set of values 
which are prescribed at every point of a subdomain of D or even at every point 
of some curve in D. For example, if the function w(z) is analytic in a domain D 
and is equal to zero at every point of a subdomain of D or at every point of a 
curve lying within D, then w(z) is zero at every point of D. 

We have seen in section 5.5 that the analytic function 


1 

1 — z 


(5-14) 


which is defined and analytic at every point of the complex plane except 
z = 1 can be expanded in the Taylor series 


oo 


2 


z” 


(5-15) 


132 


REVIEW OF COMPLEX VARIABLES 



Figure 5-6. — Domains for direct analytic contination; D c is the common part of both domains 

D\ and Z) 2 . 


which converges only within the unit circle. Thus, the function (5-15) is de- 
fined only within the unit circle, where it is equal to the function (5-14), 
which, however, is defined in a much larger region. The function (5-14) can 
therefore be considered as an extension of the function (5-15) from the unit 
circle to the entire complex plane with the point z — 1 excluded. Indeed, 
whenever an analytic function is defined by some expression (such as a power 
series) in some domain D which is not the whole complex plane, it is natural 
to ask if this function can be extended to a larger domain. 

First, consider the function w x (z) defined on the domain D , and let D 2 
be another domain, part of which coincides with D\ as shown in figure 5—6. 
It can be shown that the common region D c , which is part of both domains, 
is itself a domain 58 and hence is a subdomain of both the domain D x and the 
domain Z) 2 . Now suppose that there exists a function w 2 which is analytic 
in the domain D 2 and which is equal to Wi (z) at every point z of the common 
domain D c . Of course, the function w 2 (z) may not exist. However, if it does 
exist, it is called the direct analytic continuation of the function wi to the 
domain D 2 . 

There can be at most one direct analytic continuation of a function to any 
given domain. For if £ 2 is another direct analytic continuation of Wi to D 2 , then 
w 2 and £ 2 are both analytic in D 2 and are equal to one another in the subdomain 
D c . Hence, the fundamental theorem shows that w 2 and £ 2 are equal at every 
point of D 2 . And this of course means that £ 2 and w 2 are the same function. 


58 See, e.g., ref. 8, ch. l.The domains D 1 and D 2 are said to intersect; and the common domain D c , called the 
intersection of D x and Z) 2 , is denoted by D x Pi Z) 2 . 
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Let D be the domain which consists of all points which belong either to 
the domain Di or the domain D 2 , or both. 59 Then since the analytic functions 
Wi{z) and w 2 (z ) are equal at all points where they are both defined, we can 
define a new analytic function w(z ) on the domain D by the relation 


w(z) — 


[ Wi(z) 
w 2 (z) 


for z in Di 
for z in D 2 


It is clear that the analytic function w(z) is an extension of the analytic func- 
tion W\ (z) from the domain D\ to the larger domain D. 

The process described above does not have to terminate with the function 
w 2 (z). It may, for example, be possible to find a direct analytic continuation 
w 3 (z) of the function w 2 (z) to a domain D 3 , and so on. Proceeding in this 
manner we obtain a chain of domains D 2 , D 3 , . . . such as that shown in figure 
5-7 and a collection of analytic functions w 2 (z), w 3 (z), . . . defined on these 
domains. Each of these functions is said to be an analytic continuation of 
the function Wi(z), and the procedure itself is called analytic continuation. 
We say that W\{z) is analytically continued along a simple 60 curve T which ex- 
tends from D\ to some point P if T is completely covered by a chain of domains 
D 2 , D 3 , . . . (as shown in fig. 5-8) along which ici(z) can be analytically con- 
tinued in the manner described above. 


59 It is shown in various books on analysis that this extended region is indeed a domain. It is called the union of 
the domains D x and D 2 and is denoted by D x U D 2 , e.g., see ref. 8, ch. 1. 

60 Roughly, this means that T is smooth and does not cross itself nor have any other pathological behavior. 


134 


REVIEW OF COMPLEX VARIABLES 



Figure 5-8. — Analytic continuation along a curve. 


The collection of functions generated by the process of analytic continua- 
tion can be used, as in the case of direct analytic continuations, to define a 
new analytic function on the larger domain which includes all the domains 
on which the analytic continuations are defined. If, in addition, the function 
W\ (z) is analytically continued along a curve T, the values of this extended 
function on T itself will be the same no matter which specific collection of 
domains D z , Z) 3 , ... is used to construct it. 

5.7.2 Specific Method 

In order to make these ideas more concrete we shall consider a specific 
process which can be used, at least in theory, to obtain an analytic continua- 
tion of any given function. Thus, suppose that the analytic function W\{z) is 
defined by some expression in the domain Du For example, it may be defined 
by a Taylor series 


wAz) — ^ a„(z — z\) n 

n = 0 


(5-16) 


in which case the domain Di will be the interior of a circle of radius R x centered 
at the point z\. We suppose that R\ is finite. Choose a point Z 2 in the domain Du 
Since an analytic function is infinitely differentiable, we can calculate the 
sequence of derivatives w[ n) (z 2 ) for n = 0, 1, 2, . . . from the given expression 
for wi(z) in the domain Du For example, when the function W\(z) is given by 
the Taylor series (5-16), w ( f ) {z z ) can be obtained by differentiating equation 
(5-15), term by term, n times and evaluating the result atZ 2 - Then as indicated 
in section 5.5 the Taylor series 
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w 2 (z) 


2 

n= 0 


b„(z — z 2 ) n 


with 


b 




for n = 0, 1, 2, . . . 


will converge to an analytic function in a circle of nonzero radius centered at 
z 2 . And this function will be equal to Wi(z ) at every point which is inside both 
this circle and the domain D t . Of course, the circle of convergence of w 2 {z) 
may not extend beyond the domain D i (as shown in fig. 5 — 9(a)). If this occurs, 




(a) Circle of convergence of 2^(2) does not extend beyond D\. 

(b) Circle of convergence of 2^2(2) extends beyond D\. 


FIGURE 5 - 9 . — Analytic continuation by power series. 
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we can choose a new point zi and repeat the process. However, if the circle 
of convergence of w 2 {z) does extend beyond D\ (as shown in fig. 5-9(b)), 
then w 2 (z) will be an analytic continuation of uq(z). The process can now be 
repeated by choosing a point Z3 which lies within the circle of convergence of 
W 2 (z), proceeding to obtain a new Taylor series about this point, and so on. 

It can be shown that any analytic continuation of a given function, no 
matter how it has been obtained, can also be found by using the method of 
power series just described. It is easy to verify from this that the analytic 
continuation of the derivative of an analytic function has the same value at 
any given point as the derivative of the analytic continuation at that point, 
provided they are both carried out along the same curve. This means that the 
order of differentiation and analytic continuation can be interchanged. 

5.7.3 Singular Points 

Let w(z) be analytic at all points of a domain D except for a certain 
number of isolated singular points. Suppose, in addition, that w(z) is analytic 
on a subdomain D t of D and that Wi(z) is the restriction 61 of w(z) to Du Then 
Wi{z) is analytic on Di and can be analytically continued to any other sub- 
domain of D which does not contain singular points of w{z). As long as the 
analytic continuation of Wi(z) is carried out along a path which lies entirely 
within D, the value of this analytic continuation at any point of D will be equal 
to the value of w(z) at that point. However, since the circle of convergence of 
a power-series expansion of an analytic function will pass through its nearest 
isolated singular point (provided that point is nearer than the boundary of the 
domain), it can be seen by using the method of power series that the function 
w x (z) cannot be analytically continued along any curve which passes through 
an isolated singular point of w(z). (These ideas are illustrated in fig. 5-10.) 

More generally, let Wi(z) be analytic on some domain Du If this function 
cannot be analytically continued along any simple curve which crosses the 
boundary of Di at the point Zo, we say that the point Zo is a singular point of 
the function Wi{z). And the preceding remarks show that this definition is 
consistent with the definition of an isolated singular point given in section 5.4. 

00 

Let ^ a„(z — z 0 ) n be a power series whose radius of convergence is not 

n= 0 


61 That is, w\{z ) is a function which is defined only on D x and takes on the same values at each point of D x as the 
function w(z ). 
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X Isolated singular points 


Analytic continuation 
of wx<z) along this 


^ Analytic continuation 
/ cannot be carried out 



Figure 5-10. —Illustration of analytic continuation of a restriction of an analytic function. 



FIGURE 5-11. — Analytic continuation past a circle of convergence. 
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equal to zero or infinity. We have seen that this series converges to an analytic 
function w(z) everywhere within its circle of convergence and diverges outside 
this circle. We shall now show that this circle always passes through a singular 
point 62 of w(z). In order to obtain a contradiction, suppose that there were no 
singularities of w(z) on the circle of convergence. Then it would be possible to 
analytically continue w(z ) a finite distance outside this circle everywhere 
around its circumference (as shown in fig. 5-11). These analytic continuations 
could then be used to construct an analytic function u>i(z) which is an ex- 
tension of w{z) to a larger circle C which is also centered at £ 0 . Now it is shown 
in section 5.5 that the circle of convergence of the Taylor series expansion 
of W\{z) about zo cannot be smaller than C. But it is also shown in that section 
that this extended function wi(z) must have the same Taylor series expansion 
about z 0 as w(z). However, this is impossible since (by hypothesis) this latter 
series diverges outside of the smaller circle. Hence, we must conclude that 
there is a singular point of w(z) on its circle of convergence. 

Starting with a given analytic function Wi{z) defined on a domain D 1 , we 
can carry out the process of analytic continuation until all possible analytic 
continuations of the function W\ (z) have been found. The collection of analytic 
functions generated in this manner can again be used to define a new function 
on the domain which consists of all the domains on which these various func- 
tions are defined. The function obtained in this manner is called a complete 
analytic function. This function cannot be further extended. A point zo is said 
to be a singular point of the complete analytic function if it is a singular point 
of any analytic continuation of Wi(z). And sometimes, when no confusion is 
likely to arise, we shall say that zo is a singular point of the original function 
w\(z) itself. 


5 . 7.4 Multiple-Valued Functions 

There is a certain difficulty associated with the definition of a complete 
analytic function given in the preceding section. Thus, suppose that the 
analytic function w\ (z) defined on the domain D\ can be analytically continued 
along the two simple curves Ti and T 2 which terminate at the same point p, 


62 In section 5.5 we only asserted that the radius of convergence does not exceed the distance between z 0 and the 
nearest isolated singular point. 
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as shown in figure 5-12. There is no guarantee that the analytic continuation 
of w>i(z) along Ti will have the same value at p as its analytic continuation 
along r 2 . But if this occurs, the complete analytic function obtained from 
Wi (z) must have more than one value at the point p. Thus, although up to this 
point we have assumed that all functions are single valued, we must in general 
allow a complete analytic function to be multiple valued. It can be shown 
(ref. 19, p. 218) that the analytic continuation along the path Ti will always 
have the same value at the point p as the analytic continuation along T 2 unless 
there is a singular point (such as that shown in fig. 5-12) between these two 
curves. 63 However, the mere existence of a singular point between the two 
curves does not guarantee that the analytic continuations along the two dif- 
ferent curves will have different values at p. This only occurs when the singular 
point is a branch point. A branch point is a singular point of a function which 
has the property that the function will not return to its starting value upon 
analytic continuation along any arbitrarily small circle which surrounds 
this point. 

Suppose that the single-valued analytic function Wi(z) is defined on a 
subdomain Z) x of a domain D (see fig. 5-13). And suppose that D neither con- 
tains any singular points of u>i (z) nor is it possible to construct a closed curve 
within D which surrounds a branch point W\ (z) . Then it is impossible for any 
two analytic continuations of wi (z) along paths which lie entirely within D to 
have different values at any given point p of D. Now, in terms of these analytic 
continuations, we can construct on D (in the manner described above) an 
analytic function w (z). Then this function will be single valued. It is an exten- 
sion of w i (z) from the domain D\ to the larger domain D. And the fundamental 
theorem shows that w (z) is the only single-valued analytic function with this 
property. Therefore, when no confusion is likely to arise, we do not distin- 
guish between the two functions w(z) and wi(z) and we simply say that Wi(z ) 
is defined on the larger domain D. 

For example, consider the function Wi(z) defined by 

wi (z) =z 1/2 (5-17) 


83 This result is known as the monodromy theorem . 
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FIGURE 5— 14. — Analytic continuation of function z 1/2 . 


on a domain D\ which includes a portion of the real axis but which includes 
neither the origin nor any portion of the negative real axis, as shown in figure 
5-14. It is easy to verify that equation (5-17) represents a single-valued analytic 
function in the domain D\. In order to proceed it is convenient to introduce 
the polar representation z=re ie discussed in section 5.1. Then the function 
(5-17) can be written as 64 


wi(re ie ) = r 1/2 e W2 (5-18) 

When the point z is in D \ , the argument 6 will always lie in the range 

— TT < 6 < TT (5-19) 

(Notice that strict inequality signs are used.) Since the formula (5-18) with 
an extended range of 6 determines an analytic function at each finite point of 
the complex plane except r=0 and since this function coincides with wi(z ) 
in D i, we can use this formula to analytically continue W\(z) outside of D u 

64 Recall that we have adopted the convention that the square root of a positive real number is always the positive 
square root. 


142 


REVIEW OF COMPLEX VARIABLES 


Now the origin is a singluar point of Wi(z). Hence, suppose we analytically 
continue this function to the point p along I\, the semicircle with radius R 
in the upper half plane shown in figure 5-14. Then the value of this analytic 
continuation at p is /Ji/ 2 e « 0 »V 2 ) = R ll2 [ cos (7t/ 2) -f i sin ( tt / 2 )] = But the 

value at p of the analytic continuation along the semicircle 12 is /?^ 2 e _l,7r/2) = 
R ll2 [ cos ( — 7r/2) + i sin ( — 7r/2)] = — iR 112 - These analytic continuations of 
the function W\ (2) therefore have different values at the point p. 

If, instead of stopping at the point p, we carry out the analytic continuation 
of W\(z) first along Tj to the point p and then along T2 from the point p to the 
point q (in the direction opposite to the arrows), we arrive at the value 
R 1 l 2 e i2rrl2 =—R 112 . But since the original value of u>i(z ) at the point q is R ll2 e i0 = 
R 112 , we see that the function does not return to its original value upon ana- 
lytic continuation around this circle. And since the radius R is arbitrary, this 
shows that the origin is a branch point of W\{z). By making the transformation 
z=l/u; and taking R arbitrarily large, we can also show that the point at 
infinity is a branch point of this function. And since z — 0 and z=°° are the 
only singular points (and therefore the only branch points) of W\ (z), this 
function will always return to its original value when it is analytically continued 
around any path which does not enclose the origin. 65 

Now every analytic continuation of w 1 (z) can be obtained from the formula 
(5-18) by letting r range between zero and infinity and letting 6 take on all 
values both positive and negative. But, since e 2in7r = 1 for n = 0, ± 1, ±2, . . ., 
we need only consider values of 6 in the range 0 o s£ 0<0o + 4tt (where 6o 
can be chosen as any fixed number) in order to obtain all possible values of the 
function (5-18). Thus, the complete analytic function w{z) obtained from 
Wi(z) is the multiple-valued function defined by 

w(re i9 ) = r 1/2 e W2 0 r ^ cc; $ 0 ^ ^ 0o + 47 r (5—20) 

It takes on two distinct values at each finite point of the complex plane except 
at the origin, which is a branch point. And one of these two values is equal to 
the negative of the other. 


65 It will never return to its original value when analytically continued once around any path which does enclose 
the origin. 
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y 


(b) 

(a) Branch cut in arbitrary location. 

(b) Branch cut along negative real axis. 

FIGURE 5-15. — Branch cuts for z 1/2 . 


It is usually undesirable to deal with multiple-valued functions. 66 We can 
avoid doing this by drawing a line connecting the branch point of the function 
z 1 ^ at the origin with its branch point at infinity, as shown in figure 5-15(a), 
and then restricting the analytic continuations so that they are not carried out 
along any path which crosses this line. Such a line is called a branch cut and 
it is used to prevent analytic continuations from being carried out along curves 
which encircle the origin. The actual location of the branch cut is arbitrary but 
we may, lor definiteness, assume that it lies along the negative real axis, as 
shown in figure 5-15(b). Then starting with the original function (5-17) (which, 
as can be seen from eqs. (5-18) and (5-19), is positive along the positive real 
axis) and analytically continuing this function along all allowable paths in the 
complex plane, we obtain the extended function 


“After all, we would not expect a well-defined physical problem to have a solution which is multiple valued. 
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Wi ( re ie ) = r ll2 e i012 Os£rs£co ; -7j-sS0*£7r 

which is single valued and analytic at every point of the complex plane not lying 
on the negative real axis. We can also analytically continue the function which 
is equal to the negative of the original function (5-17) in D\ along all allowable 
paths in the cut plane to obtain the extended function 


Wi(re id ) = r ll2 e' ei2 0 < r < °°; w < 6 < 3n 

And this function is also single valued and analytic at every point of the complex 
plane not lying on the negative real axis. The functions W\ and Wi are said to 
be branches of the double- valued function (5-20). Taken together these two 
branches assume all the values of the multiple-valued function and are there- 
fore equivalent to it. Hence, we can deal with a multiple- valued function by 
replacing it with its single- valued branches. 

The complete function in this example is double valued and therefore 
has two branches. However, we also encounter multiple-valued functions 
which take on infinitely many values at each point and therefore have infinitely 
many branches. For example, the function w(z) = In z is defined in polar 
notation to be 


w(re ie ) = In re i0 = In r+ id 

In order to obtain the complete analytic function we must (as in the preceding 
example) let r take on all values in the range 0 < r < oo and 6 all real values. 
But since z= re i0 — re i{0+2nJT) if, and only if, n = 0, ±1, ±2, . . ., this complete 
function must have infinitely many values at each point. And these values 
differ from one another by multiples of 2ni. This function also has a branch 
point at the origin and a branch point at infinity. And if the branch cut is 
again taken along the negative real axis, the infinitely many branches w n (z ) 
for n — 0, ±1,±2, . . . of In z become 

w n {re i0 ) = In r+ if? 0 < r < a>; (2n — 1) tt < 6 < (2n + 1) tt 

for n = 0, ± 1, ± 2, . . . 


The branch corresponding to the range — tt<6<tt is called the principal 
branch of the logarithm. 
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The function z a (where a is some complex constant) can be defined in terms 
of the logarithm by the formula z a =e alnz . Hence, the branch points of this 
function can be located only at the origin and at infinity. By using polar notation 
we can express this function in the form z a =e alnr e“ ie =r"e l “ e . And this shows 
that the complete function is multiple valued unless a is an integer. In fact, 
it takes on infinitely many values at each point unless a is a rational number. 
The various branches of this function can be formed in the same way as for the 
logarithm. 

In a similar manner, it can be seen that for any finite point z 0 the complete 
analytic function associated with (z — zo) a is multiple valued whenever a is 
not an integer. Its branch points are z = zo and z = °° and its branch cut can be 
taken along any line joining these two points. However, once a branch cut has 
been chosen, the various branches of this function will then be analytic every- 
where in the cut plane. 

5.8 PERMANENCE OF FUNCTIONAL RELATIONS 

Let F(zi, . . ., z„) be a complex function of the n complex variables 
zi = Xi + iy\, . . ., z n = Xn+iy n ■ If this function can be expanded in a power 
series 


F(zi, . . .,z»)= a h , . . .,a in zjiz|2 . . . zj» 

it, • • -,in= 1 


with complex coefficients a*,, . . ., a,-„ and if this series converges in some 
neighborhood of each point in some domain of the 2rc-dimensional space whose 
coordinates are xt, . . ., x n , yi, . . ., y n , then we say that F is an analytic 
function of the n complex variables z i, . . . , z n in D. This is clearly an exten- 
sion of the definition of an analytic function of a single complex variable given 
in section 5.2. 

Now let F(wi, . . ., w n , z) be an analytic function of the n+1 complex 
variables W\, . . ., w n , z for all values of the variables W\, . . ., w n and for 
all values of z in some domain D 0 . If u>i(z), . . ., w n (z) are analytic functions 
of the complex variable z (in the usual sense) in some common subdomain 
D of Do, then it can be shown (ref. 4) that the function g(z ) defined by 
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-Path of analytic 
continuation of 
the functions 
w 1 (z),...,w n (z) 


Figure 5-16. — Illustration of permanence of functional identities. 


g(z) = F (wi(z) , . . . , Wn(z) , z) 


is also analytic in D. 

Now suppose that icj(z), . . w n (z ) are analytic continuations of 

Wi(z), . . . , w n (z), respectively, from D to some other common subdomain 
D of Do and these analytic continuations can all be obtained by analytically 
continuing the functions Wt(z), . . ., w n (z ) along a single curve T which 
lies entirely within Do. (This is illustrated in fig. 5-16.) In addition, suppose 
that W\ (z) , . . . , Wz (z) satisfy the equation 

F(wi, . . .,iv n ,z) = 0 (5-21) 

at all points z in D. Then by analytically continuing these functions along T to 
D and using the fundamental theorem of analytic continuation given in the 
beginning of section 5.7, it can be shown (ref. 19, p. 210) that u>i(z), . . ., 
w„(z) also satisfy equation (5-21) at every point of D. This is known as the 
principle of permanence of functional relations. Roughly speaking, it means 
that the analytic continuations of the solutions of equation (5-21) are also 
solutions of this equation. 
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5.9 DIFFERENTIAL EQUATIONS IN COMPLEX PLANE 


5 . 9.1 Definition 

We have already indicated the utility of extending the definition of a 
differential equation to include the case where the variables are complex. To 
this end let F(wi, . . ., w n ,z) be an analytic function of the n+ 1 complex vari- 
ables Wi, . . . , w n , z in some domain D. Then the nth-order normal differential 
equation in the complex domain is an equation of the form 


d n w „ ( dw d n l w \ 


(5-22) 


Notice that in writing this equation we imply that its solutions, if they exist, 
must possess complex derivatives at all points where they are defined. It is 
also reasonable to require that the solutions satisfy the equation at least on some 
domain in thez-plane. Hence, the solutions to equation (5-22) must be analytic 
functions. This is a much stronger restriction than is imposed in the case of real 
variables, where we require only that the solutions be sufficiently differentiable. 


5 . 9.2 Fundamental Theorem 

The following fundamental theorem (which is analogous to that given in 
chapter 1 for the real- variable case) can be shown to hold (ref. 4, p. 119). For 
each point 67 £i, . . ., £ n , zo of the domain D where the function F is analytic, 
there exists a unique ( i.e single-valued) function w{z) which satisfies the initial 
conditions 

w(zo) = £l, tv' (z 0 ) = . . ., M/ n- 1 ) Cz 0 ) ={ n 

is analytic, and satisfies equation (5-22) in some neighborhood of the point zo. 
We shall be principally interested in the (effectively normal) linear equation 

^•+ fli(z) + • • • +a»_i(z) ^+a„(z)w+b(z) =0 (5-23) 

where the coefficients a t (z) , . . . , a„ (z) , b (z) are all analytic on some common 
domain Do. This equation is effectively of the form (5-22) with the function F 


67 Ci, ■ - ■, £n, zo are n + 1 complex numbers. 
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analytic on the domain D which consists of all the values of the variables 
Wi, . . ., W 2 and all the values of the variable z which lie in the domain Do. 
Hence, the fundamental theorem now becomes: For each set of complex 
numbers £j, . . £„ and each point 20 of Do there exists a unique function 

w(z) which satisfies the initial conditions 


w(zo) = Cl,w'(zo)—C2, • • w (n ^(zo)— £n 


is analytic , and satisfies equation (5—23) in some neighborhood of the point zo. 

There is also, in this case, an additional result, due to Fuchs (ref. 24, p. 4) 
which asserts that this solution, w(z), has a Taylor series expansion about zo 
whose radius of convergence is at least equal to the shortest distance between 
zo and the boundary of Do. 

Now suppose that w(z) is a solution to equation (5-23) on some subdomain 
D\ of Do. Then the (single valued) function w(z) and all its derivatives are 
analytic on Du Let w(z) be an analytic continuation of w(z) along some curve 
in Do to some other subdomain D of Do, as shown in figure 5-17. Then since, 
as indicated in section 5.7.2, the analytic continuation of a derivative of an 
analytic function is equal to the derivative of the analytic continuation, we can 
apply the principle of permanence of functional relations to equation (5-23) 



Figure 5-17. — Illustration of analytic continuation of solution to differential equation. 
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to show that the analytic continuation w(z) is itself a solution to equation 
(5-23). Thus, the analytic continuation along any curve lying entirely within 
Do of a solution to equation ( 5-23 ) is also a solution of this equation. 

For example, the differential equation 


w 


" -\-~7~zW — 0 


6z 


6z 2 


has the solution 


w(z) = Z l/2 + Z l/3 


But upon analytic continuation of this solution around any closed path en- 
circling the origin, we obtain the function 

Wl ( z ) =(e 2lTi z) 1 1 2 + (e 27ri z) 113 = - z 1/2 + e 27ril3 z m 

And it is easy to verify by direct substitution that W\ (z) is also a solution of the 
equation. (In fact, w(z) and W\ (z) are linearly independent.) 

We shall now show that any solution to equation (5-23) which is defined 
on a subdomain of D 0 can indeed be analytically continued along any curve in 
Do. This means that no solution to equation ( 5-23 ) can have a singular point 
in Do. The assertion can be proved by assuming that there exists a solution 
w(z) in a subdomain Di of D 0 which cannot be analytically continued along 
some curve T in Do and then showing that this leads to a contradiction. Thus, 
if w(z) cannot be continued along T, there must be a point Z\ on T as shown in 
figure 5-18 such that w(z) can be continued up to, but not past, this point. We 
can therefore choose a point z 2 on the portion of T joining D\ to z t which is closer 
to Zi than any part of the boundary of D 0 . Upon analytically continuing the 
solution w(z ) along T to z 2 , we obtain a solution w 2 (z) of equation (5-23) 
in a neighborhood of z 2 . And Fuchs’ result shows that w 2 (z ) can be expanded 
in a Taylor series about z 2 that converges in a circle which includes the point 
Zi. But this constitutes an analytic continuation of w(z) along F past the point 
Z\, which was assumed to be impossible. And this proves the assertion. 

It follows from these results that any solution 68 to equation (5-23) in 


68 The existence of such a solution is asserted by the fundamental theorem but only in some neighborhood of this 
point. And this neighborhood could be very small. 
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Figure 5-18. — Continuation of w{z) along T. 


a neighborhood of a point where the coefficients are analytic can actually be 
extended to obtain a solution to this equation on the entire domain Do on which 
the coefficients are analytic. However, in order to do this we must, in general, 
allow these solutions to be multiple valued. 


5.9.3 Linearly Independent Solutions 

By using the fundamental theorem given in this section, the various results 
given in section 1.6 for linear equations with real variables can be extended to 
the complex-variable case. Thus, the homogeneous equation associated with 
equation (5-23) 


d^w f ^ d^ j | / \ ^ i c o/i \ 

~d^ +ai{z) ~d^ + • • ' +a -^ w -° t 5 " 24 ) 

possesses n linearly independent (single valued) solutions w\{z ) , . . .,w n (z), 
called a fundamental set of solutions, in a neighborhood of each point of 
D 0 . Now let w p (z ) be a particular (single valued) solution of equation (5-23) 
in the neighborhood of some point of Do. Then every solution of this equation 
about this point can be obtained by making a suitable choice of the arbitrary 
constants Ci, . . ., c n in the general solution w(z) = CiWi(z) + . . . +c n tc n (z) + 
w p (z ) of this equation. 
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The definition and discussion of linear independence given in section 1.6 
applies with only trivial modification to the case where the functions are com- 
plex and analytic. Thus, in particular, a necessary and sufficient condition 
(ref. 24, p. 13) that n (single valued) solutions ici(z) , . . ., w n {z) of equation 
(5-24) be linearly independent on a domain Do on which the coefficients are 
analytic is that the Wronskian 


W\ W2 ... W n 

tv'i IV2 ... w' n 


fV(wi, . . .,«;„) = 


(5-25) 


w [n-l) W (n-X) 


W 


(n-1) 

n 


not be equal to zero at any point Z\ of D 0 . Since w\, . . ., w„ are analytic func- 
tions of z, the Wronskian itself is an analytic function of z which we shall denote 
by W(z). Thus, 

r(z)*rw:) Mz)) 

It is also easy to see that the Wronskian of the analytic continuations of 
w\ (z) , . . . , w n (z) along a curve T in Do is equal to the analytic continuation of 
the function W (z) along T. 

It can be shown by using the rules for differentiating determinants and by 
substituting in the differential equation (5-24) (ref. 24, p. 12) that W satisfies 
the first-order differential equation 


<W 

dz 


P\{z)W 


(5-26) 


And upon separating the variables and integrating along any path T in Do, 
we obtain 

?T(z) = ce (5-27) 
where c is a complex constant of integration. 
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Since pi(z) is analytic in Do, its integral J pi(z)dz must also be analytic 

in Do. Hence, in particular, this integral cannot become infinite at any point of 
D 0 ; and therefore the exponential factor in equation (5-27) can never vanish 
in D 0 . Thus, W{z) can only be equal to zero at a point Z\ of Do if c = 0. And 
this shows that, if W'iz) vanishes at any point of Do, it must vanish at every point 
of Do- 

In the special case where n — 2, equation (5-27) can be used to obtain an 
explicit formula which determines a second linearly independent solution 
w 2 (z ) to equation (5-24) when one solution uh(z) to this equation is known. 
Thus, when n = 2, we find upon expanding the determinant and rearranging 
that 

w? -y- ( —] = W(z) = ce-l p ' iz) * 

dz \ w\ / 

And integrating this along any curve in Do yields the formula 

f e ~f 

w 2 (z) = cwi(z) J dz (5-28) 

which agrees with the formula obtained by the method of variation of parame- 
ters in section 4.1. It follows from the way in which it was constructed that any 
solution w 2 {z) calculated from this formula will be linearly independent of 

wi(z). 

More generally, it can be shown (ref. 24, p. 16, example 10) that, if n — 1 
linearly independent solutions, say wi(z) , . . ., w n -\{z) , to equation (5-24) 
are known, another linearly independent solution to this equation is given by 


n - 1 f g-jpiM* 

W n = c ^Wi(z) J Mi(z)dz 


where W \ (z) is the Wronskian 
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Wfz) = W(Wi, . . ., W n - 1 ) = 


Wi 

W 2 ... 

tVn - 1 

w[ 

W 2 

W ' 
n - 1 

w (n-2) 

m4”- 2) . . . 

jij( n ~ 2 ) 

w n- 1 


and, for £=1,2, . . n — 1, Mj(z) is the cofactor of w ? -2 in this Wronskian. 
That is. 


Mi(z) 


dW_ 

dw v- 1 


5.10 NONELEMENTARY TRANSCENDENTAL FUNCTIONS 

We shall have occasion to use two particular nonelementary functions 
of a complex variable called the gamma function and the beta function. First, 
we define the analytic function T(z) for^z > 0 to be the Eulerian integral 
of the second kind. 



-H z ~ l dt 


(5-29) 


This integral converges in the right half plane ^ z > 0 and diverges for 
z^O. It can be shown (ref. 25) that it represents an analytic function in its 
domain of convergence. Although this analytic function is only defined by 
equation (5-29) in the right half plane, it can be analytically continued into the 
left half plane OfL z *£ 0 by using the formula 


r(z)T(l -z) =^— (5-30) 

Sin 7TZ 

to compute the values of T(z) for z *£ 0 from its values at points in the 
right half plane. It follows from this equation that T(z) has simple poles at 
z = 0 , - 1 , — 2 , . . . 

Integrating equation (5-29) by parts (with z replaced by z+ 1) shows 
that for z> 0 
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roc 

r(z+l) = zl e~H z ~ x dt 

Hence, 

r(z+l) =zT(z) ( 5 - 31 ) 

By successively applying equation (5-31) we find that for any positive 
integer n 

T(z + n) = (z + n — l)T(z + n — 1) 

= (z+n — 1) (z + n — 2)T(z + ra — 2) 


= (z + n — 1) (z + n — 2) . . . (z+l)zl"(z) (5-32) 

It is convenient to introduce a special notation for the factor multiplying 
T(z) in the last member of this equation. Hence, we define the generalized 
factorial function (z) n by 


and 


ft — 1 

(z)n= n (z+m) 

m=0 


— z(z+ 1) (z + 2) 


w«=i 


(5-33) 


(z+n — 1) for n= 1,2, 3, . 


Thus, the symbol (z) n denotes the product of n factors, each factor being 
one larger than the preceding one. For example, 
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Notice that when z = 1 in equation (5-33) , we obtain the ordinary factorial 
function since 

(1)„= 1 -2-3 . . . n = n\ (5-34) 

By using this notation, equation (5-32) can be rewritten as T(z+ n) = (z)nT(z). 
And, therefore, the generalized factorial function can be expressed in terms 
of the gamma function by 

(2)n = n fX^ for n= 1,2, 3, . . . (5-35) 

Since integrating equation (5-29) with z= 1 shows that T(l) = 1, we find 
from equations (5-34) and (5-35) that 

r(/i+l) = /i! for 7i= 1,2, 3, .. . (5-36) 

This equation is also valid for n = 0 provided we use the usual definition 0!= 1. 

The beta function B{z, £) is a function of two complex variables and is 
defined by 


fi(z, {) = f 


(5-37) 


for z > 0 and ^ £ > 0. However, it is possible to express this function 
in terms of the gamma function since 


e~ r T^~ l dr 


■WU) - ( JT ) ( l 

= e- T r l-'drjdt 


) 


And upon setting x = r/t, we can eliminate r from this equation to obtain 

r(z)r({) 


f e~‘t z ~ 1 t i ( 

f e~ tx x^~ l dx\ 

Jo \ 

Jo J 
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Hence, after changing the order of integration we find that 

r(z)r(£)= J o dx 

Then by defining T\ by ri = t(x + 1) and eliminating t, we get 

ro ° rt- 1 

’ ' e-’iT'+l-'dn 

dx 


f°° rt - 1 r 

r « r( »=J. ITT 

r x 1 - 1 
- r(2 + »i. ITTTT 


(l+*)«+* 

And after setting x=t/(l — t) this becomes 

r(z)r(o = r(z+o JV 

But comparing this with equation (5-32) shows that 

mi 


l dt 


B(£,z) 


r (z+o 


(5-38) 
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CHAPTER 6 


Solution of Linear Second-Order Differential 
Equations in the Complex Plane 

In section 5.9 we extended the definition of a differential equation to 
include the case where the variables are complex. In this chapter the ideas 
presented in that section are used to find the behavior of and, in certain 
cases, to construct solutions to the second-order differential equation 

§+pwf + «W» = ° (6-1) 

where z is a complex variable and the coefficients p(z) and q(z) are analytic 
functions in some domain D of the z- plane. We have seen that every solution 
of this equation is an analytic function of z. 

A point at which the coefficients of equation (6-1) are both analytic is 
called an ordinary point of this equation. Thus, every point of D is an ordinary 
point. A point z o which is a singular point of either p{z) or q(z) (or both) is 
said to be a singular point 69 of equation (6-1). And if the only singularities of 
p(z) and q(z) which occur at Zo are isolated singular points, zo is also said 
to be an isolated singular point of equation (6-1). 

6.1 GENERAL BEHAVIOR OF SOLUTIONS AT ORDINARY POINTS 

Let zi be any point of D. It was indicated in section 5.9 that equation 
(6-1) will possess two linearly independent (single valued) solutions, w\(z) 
and io 2 (z), in some neighborhood of Z\. These solutions are said to be a funda- 
mental set of solutions. They possess Taylor series expansions about zi, say 


69 Notice that this definition is consistent with the one given in section 1.5. 
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Wi{z) a{ n } (z — Zi) n 

n-0 I 

( 6 - 2 ) 

m (z) = j? a ( „ 2 >(z-zi) n ] 

71=0 ' 

whose radii of convergence are at least equal to the shortest distance between 
zi and the boundary of D. Since the coefficients of equation (6-1) can always 
be analytically continued across those points on the boundary of D which 
are not singular points of the equation, it is always possible to extend the 
domain D (in which the coefficients are analytic) in such a way that every 
finite point on its boundary 70 is a singular point of equation (6-1). Hence, we 
can assert that the radii of convergence of the Taylor series expansions (6-2) 
of the solutions W\ (z) and w 2 (z) tire at least equal to the distance Ro between 
z\ and the nearest singular point of equation (6-1). It was also shown in 
section 5.9 that the solutions wfz) and w 2 (z) can be analytically continued 
along any simple curve in D and that these analytic continuations are them- 
selves solutions of equation (6-1). Thus, in particular, we can assert that 
the series (6-2) not only converge within a circle of radius Ro but that they 
also satisfy the differential equation everywhere within this circle. 

6.2 GENERAL BEHAVIOR OF SOLUTIONS NEAR ISOLATED SINGULAR POINT 

Suppose that z 0 is either an isolated singular point or an ordinary point 
of equation (6-1). Then the coefficients p(z) and q{z) of this equation will be 
analytic within the punctured circular domain 0 < | z — Zo | < R (shown in 
fig. 6 _ 1) whose radius R is equal to the distance between zo and the singular 
point 71 of equation (6-1) closest to z 0 . We shall call this domain 72 A. Now 
let Wi(z) and w 2 (z) be a fundamental set of solutions to equation (6-1) in 
a neighborhood Do of a point z of A. We have seen in section 5.9 that these 
solutions can be extended (by using their analytic continuations) so that 
they satisfy equation (6-1) at every point of A. However, since z 0 may be 
a singular point and since it can be encircled by a curve such as the curve 
T shown in figure 6—1, these extended solutions can be multiple valued. 
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70 Provided D has a boundary. 

71 Other than z 0 itself. 

72 Notice that zo is not a point of A. 
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Figure 6- L — Contour for analytic continuation of w\ and w 2 . 


In order to deduce the behavior of these extended solutions, we shall 
now construct a fundamental set of solutions on A whose structure is particu- 
larly transparent. To this end let the fundamental set W\(z ) and w 2 {z) be 
analytically continued from Do counterclockwise around I\ This process will, 
in general, yield two new functions of z, say W x (z) and W 2 (z) which are defined 
on D 0 by 


W,(z) =w i(e 2nt (z — Zo) +zo) 
W 2 {z) =w 2 (e 27ri (z — zo) +zo) . 


( 6 - 3 ) 


But we know that lP\(z) and W 2 (z) must satisfy the differential equation (6-1) 
and that every solution to this equation in Do can be expressed as a linear 
combination of the fundamental set of solutions Wj(z) and w 2 (z). Thus, there 
exist complex constants an, a i 2 , a 2 i, a 22 such that 


( 6 - 4 ) 


W 1 (:) = flii!Oi(z) + ai 2 MJ2(z) 

r 2 (z) = a 2 iu;i(z) + a22^2(z) 
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For example, consider the differential equation 73 

w' +2^2 w = 0 
02 6z 2 

and let zo = 0. A fundamental set of solutions to this equation is w>i = z 1/2 and 
w 2 = z 113 . Upon applying equations (6-3) to these solutions, we find that 

W \{z) —W\ (e 27 ri z) — (e 2 ni z) ll2 = (— l)w;i(z) 

and 

W 2 {z) = e 2 iri|, 3 z ^ 3 = e 27 ri/ 3 w 2 (z) 


And, therefore, the constants which appear in equations (6-4) become, in this 
case, aj i = — 1 , a !2 = 0 , a 2 1 = 0 , an = e 2nil3 . 

We shall now show (in the general case) how the constants in equations 
(6-4) can be used to construct the desired fundamental set of solutions to 
equation (6-1) on the domain A. In order to do this, we first recall from the 
theory of linear equations that the algebraic equations 


(an — \)ci + a 2 \c 2 — 0 
ai2Ci + (a 2 2 — A.)c 2 = 0 . 


(6-5) 


have a nontrivial solution 74 in Ci, c 2 , provided X is a root of the characteristic 
equation 


= \ 2 — (an + a 22 )\ + (anU22 — ai202i) = 0 ( 6 - 6 ) 

This equation will either have two distinct roots or two equal roots. We shall 
treat these two cases separately. 


flu X a 2 j 
ai 2 a 22 — X 
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73 This equation was discussed in the example given in section 5.9.2. 

74 That is, a solution other than c\ = C 2 — 0. 
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6 . 2.1 Case I: Roots of Characteristic Equation Distinct 

First, suppose that the two roots of equation (6-6), Xi and X 2 , are distinct. 
Let 4 X >, 4 1 * be a set of nontrivial solutions to equations (6-5) corresponding 
to the root Xi and 4 2) , 4 2) be a set of nontrivial solutions corresponding to 
the root X 2 . Thus, 


(«u — Xi)c^ x) + a 2 i4 x) = 0) 


ai 2 C ( j x) + (a 2 2 — Xj)4 x) = Oj 


(a n - X 2 )4 2 » + a 2 i4 2) = 0 

ai2C< 2) + (a22 — X2 )c (2) = 0 


(6-7a) 


(6- 7b) 


Now let Ui(z) and 1 * 2 ( 2 ) be the solutions to equation (6-1) which are 
defined in D 0 (and only in Do) in terms of the fundamental set Wi(z ) and 
w >2 (z) by 


«i(z) = 4 x) wi(z) + c^w^z) (6-8) 

u 2 (z) = c^wiiz) + 4 2) u; 2 (z) (6-9) 

Then it follows from equations (6— 3), (6—4), and (6—8) that Ui(e 2m (z zq) + zo), 
the analytic continuation of t*i(z) counterclockwise around T, is given by 

Ui(e 2lTi (z — zo) + zo) = c^Wi(e 27ri (z — z 0 ) + z 0 ) + 4 x) M/ 2 (e 2lT '(-z — zo) + zo) 


= c^Wriz) + c^W 2 (z) 

— [4 !) «n + 4 1 >a 2 i]wi(z) + [c(, x) ai 2 + 4 1 >a 22 ]w 2 (z) 
Hence, equations (6-7a) and (6-8) imply 

ui(e 2ni (z — zo) +zo) = Xic^ x) m;i(z) + Ai4%i(z) = XiUi(z) (6-10) 
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And the same argument shows that the analytic continuation of the solution 
Ui(z) around T, U2(e 2,ri (z — z o) + 20), is 

U2 (e 2ni (z — zo) + 20) = A-2«2(z) (6-11) 

The constant Xi is nonzero. If this were not true, equation (6-10) would 
show that ui(e 2ni (z — 20) +20) is identically zero in D 0 . But then ui(e 2ni (z — 20) 
+ zo) could be analytically continued backwards along T to show that 1*1(2) 
is identically zero. And since the constants and c ^ are not both zero, 
equation (6-8) would show that Wi(z) and 1^2(2) are linearly dependent. But 
this is impossible since wi{z) and 1^2(2) are, by hypothesis, a fundamental 
set. Hence, we conclude that Ai ¥=■ 0. The same reasoning shows that X2 0. 
We can, therefore, define two (finite) numbers 81 and 82 by 75 

81 = -^—. In \i 82 = 7T~. In X 2 (6-12) 

2m 2m 

and use these numbers to define the two analytic functions f\{z) and/ 2 (2) in 
Do by 


f\ (2) = (2-2 0 )~ s mi(z) 
fl (2) = (2-2o)- 6 2U 2 (2) . 


(6-13) 


Since the right-hand members of these equations are products of functions 
which can be analytically continued along any simple curve in A, it follows that 
fi (2) and fj{z) must also have this property. And since 1^1(2) , **>2(2) , (2 — 2o)~ Sl 
and (z — Zo)~ Sl have no singular points in A, it follows from equations (6-8), 
(6-9), and (6-13) that /i(2) and fiiz) have no singular points in this region. 76 
Hence, these functions have no branch points in A. On the other hand, the 
analytic continuations fi{e 27ri (2— 20) +20) of fi(z) for t=l, 2 in a counterclock- 
wise direction around any curve F which encloses 20 are 


fi{e 2ni (z — 20 ) + 20 ) = (e 27Ti (z — zo))- Si Ui(e 27Ti (z — z 0 )+zo) for i— 1 , 2 


75 The condition Ai ^ implies that 61 — 62 is not an integer. 

76 However, the point z o may be a singular point of these functions. 
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Hence, it follows from equations (6-10) and (6-11) to ( 6 - 13 ) that 
fi(e 2rri (z — zo) + zo) =7- (z — zo)- Si Ui(e 2ni (z — z 0 )+zo) 

A i 


= (z — zo) ~ Si Ui (z) =fi ( z ) for i = 1 , 2 

But this shows that the point z<> is also not a branch point of the functions 
fi(z ) , i— 1 , 2. Hence, these functions will return to their original values when 
they are analytically continued around any closed curve in A. And this implies, 
as indicated in section 5 . 7 , that they can be extended to single-valued analytic 
functions on the entire domain A. We can therefore suppose that these ex- 
tensions have already been carried out and that /i(z) and f 2 (z) are defined on 
the entire domain A. Hence, it follows from equations ( 6 - 13 ) that the extensions 
of the solutions «i(z) and U2(z) from Do to the domain A (which we shall 
also denote by Ui(z) and u 2 (z)) are functions of the form 


ttl(z) = (z — Zo) 8 l/l(z) 
u 2 (z) = (z — z 0 ) s 2/ 2 (z) 


( 6 - 14 ) 


where /i(z) and f 2 (z ) are single-valued analytic functions on the entire domain 
A. (The point z 0, however, may be an isolated singular point of these func- 
tions.) And as indicated in section 5.8 the extended functions «i(z) and u 2 (z ) 
must satisfy the differential equation (6-1) everywhere within A. These are 
the solutions which we have set out to construct. We shall now show that 
they are linearly independent. To this end suppose that yi and y 2 are any 
two constants such that 


yitii(z) -\-y 2 u 2 {z) = 0 


Substituting equations (6—8) and ( 6 — 9 ) into this equation shows that 


[ yic\ 1} + y 2 c< 2) ] wi (z) + [ yic^ ] + y2C ( 2 2) ] w 2 (z ) = 0 
And since w x (z) and w 2 (z) are linearly independent, it follows that 


165 



DIFFERENTIAL EQUATIONS 


yic^ + y 2 c( 2) = 0 
■yic* 1 * + y 2 cf ) =0 


(6-15) 


Hence, upon multiplying equation (6-7 a) by y\ and equation (6-7b) by y 2 
and adding the results, we find that 


(X! — X 2 )yic) 1) = (A.! — \ 2 )y 2 c(> 2) = 0 


But since we are considering the case where Xi # X 2 , this implies that 


yic^ = y 2 c^ ) = y 2 c( 2) = yic^ = 0 


And by using the facts that and ci 2 l) are not both zero and c (2) and c ( 2 2) are 
not both zero, we can conclude that yi = y 2 = 0. But this shows that «i(z) 
and « 2 (z) are linearly independent. 

Before discussing the implications of these solutions, we shall first treat 
the case where the roots of the characteristic equation are equal. 


6.2.2 Case II: Roots of Characteristic Equation Equal 

Thus, suppose that the roots of equation (6-6) are equal. Then the argument 
used in the preceding section can easily be adapted to show that there still 
exists at least one solution to equation (6-1) of the form 


wi (z) = (z — zo) Sl /i (z) (6-16) 

where /i(z) is a single-valued analytic function in A and 8i=(l/27ri) In Xj 
with Xi 0. We can therefore assume that the fundamental set of solutions 
it>i (z) and 102 (z) of equation (6-1) in the domain D 0 (fig. 6-1) has been chosen 
in such a way that J 0 i(z) is given by equation (6-16). By analytically con- 
tinuing this solution around T we find that in this case the function W\ (z) 
(defined in eqs. (6-3)) is given by W 1 (z) — k 1 wi(z). And since Wi(z) and w 2 (z ) 
are linearly independent, the first equation (6-4) shows that 
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dl2 — 0 


and 


un — Ai 

However, these equations imply that the roots of the characteristic equation 
(6—6) are Xi = an#0 and k 2 = a 22 . But since we are considering the case 
where A.i = A. 2 , it follows that an = a 22 = Xi. Hence, in this case equations 
(6—4) become 


W i(z) = X!^i(z) 
lF 2 (z)=a 21 wi (z)-t-X.iW 2 (z) 


We now define the function f 2 (z) on the domain Do by 

f 2 (z) = (z — z 0 )~ Sl w 2 (z) — In (z — Zo) 

Ziti Ai 


(6-17) 


(6-18) 


This function can be analytically continued along any simple curve in A, and 
it will return to its original value when it is analytically continued along any 
closed curve which does not encircle zo. Its analytic continuation (counter- 
clockwise) along any curve T which encircles zo is 


f 2 (e 2ni (z — zo) +zo) 


(z— Zo) Si e - 2 mSifp r 2 


(z)- 


« 2 l/l(z) 

2mki 


In e 2ni (z — zo) 


(z— z p)~ Sl 

X, 


W 2 (z) 


«2l/l jz) 
2mki 


In (z-z 0 )-f i / 1 (z) 

A-X 


But inserting equation (6-16) and the second equation (6-17) into this relation 
shows that 
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/ 2 (e 2,ri (z — zo) +zo)— fz{z) 


Hence, the function f 2 (z) will return to its original value when it is analytically 
continued around any closed curve in A. It can therefore be extended to a 
single-valued analytic function on the entire domain (which may have an 
isolated singular point at zo)* And we can now conclude from equations (6—16) 
and (6-18) that the extension of the solution w 2 (z) from/) 0 to the entire domain 
A (which we shall also denote by w 2 (z) ) is a function of the form 

w 2 (z) = (z—z 0 ) Sl f 2 (z) + awi(z) In (z — z 0 ) (6-19) 

where f 2 (z) is a single-valued analytic function on the entire domain A and we 
have put a = a 2 i/27ri\i. It can again be shown that the solutions Wi(z) and 
w 2 (z) are linearly independent. 


6 . 2.3 General Conclusions 

Notice that the fundamental set of solutions (6-16) and (6-19) with a = 0 
are of the same form as the fundamental set (6-14) with 8i = 8 2 . The funda- 
mental set (6-2) which occurs at an ordinary point of equation (6-1) is a special 
case of the fundamental set (6-16) and (6-19). We have therefore established 
the following conclusion: Let zo be an ordinary point or an isolated singular 
point of equation ( 6-1 ) and let R be the distance between zo and the singu- 
lar point of equation ( 6-1 ) which is closest to zo* Then equation ( 6-1 ) possesses 
a fundamental set of solutions on the punctured circular region 0 < | z — zo | <R 
which is of the form 


wi(z)= (z — z 0 ) 8, /i(z) 

w 2 (z) = (z — zo) hl f 2 (z) + awi(z) In (z — z 0 ) . 


( 6 - 20 ) 


where fi(z) and f 2 (z) are analytic single-valued functions in 0< \z — zo| <R 
and a, 8i, and 8 2 are complex constants. 

The fundamental set (6—20) is called a canonical basis. The constant a 
is equal to zero whenever 8i — 8 2 0, ± 1, ±2, . . .. The case where 8j — 8 2 

= 0, ±1, ±2, . . . is referred to as the exceptional case. Notice that the 
constant a may also vanish in the exceptional case. This occurs, for example, 
when zo is an ordinary point of equation (6-1). 
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We should not conclude from these results, however, that the solutions 
of equation (6-1) will always have singularities at the singular points of this 
equation. For example, the equation 

z 2 w" — 2 zw' + 2w = 0 

has an isolated singular point at z = 0. But the general solution to this equation 
is 


w = Aoz + A iz 2 

where A o and A x are arbitrary. And this shows that every solution of this 
equation is analytic at z = 0. 

By changing notation we can rewrite this solution in the form 

w — a<> + ai(z — 1) + (aj — ao) (z — l) 2 

where ao and a\ can now be taken as arbitrary. This is evidently a canonical 
basis at the ordinary point 2 = 1 . Since it is an entire function, it certainly 
exists and satisfies the equation within a circle whose radius is larger than 
the distance between z=l and the nearest singularity (namely, 2 = 0 ) of the 
differential equation. 

If either of the functions fi{z) and fz (z) in equations (6—20) is not analytic 
at zo, then 20 must be an isolated singular point of this function. Hence, f\ ( 2 ) 
and/ 2 (z) can always be expanded in the Laurent series 77 


/i(z)= £&<?>(* -2 „)»' 

n—— 00 

/.(*) = 5 


( 6 - 21 ) 


which converge for 0< |z — 2o| < R. An important special case occurs when 
the functions /i(z) and fi(z) either have poles or are analytic at zo. Then the 
series (6-21) contain at most a finite number of negative powers of 2 — zo and 
can therefore be written in the form 


77 Which may or may not contain negative powers of (2—20). 
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fi— 2 b^{z — zo) n for i=l,2 

n=-rj 


where ri and r 2 are finite integers (they may be negative or zero).. However, 
we can shift the index in the sums by putting k—n+ri and then summing on 
k. Thus, 


/iU) — X bf_ r .{z— zo) k ~ r i= (z — zo)~ r i j>] a ( k °(z — zo) k for i= 1,2 

k=0 1 k = 0 

where we have put = b ( ^_ r , for i— 1, 2. 

Inserting these equations into equations (6-20) shows that in this case the 
canonical basis is of the form 


Wi(z) = (z — zo) Pl jr a^\z — Z(,) k 

k = 0 

W2(z)= (z — zo) p * j? a^ } (z — zo) k + awi(z) In (z — Zo) 

k = 0 j 


( 6 - 22 ) 


where we have put px = 8x — n and p 2 — 8 2 — r 2 . The series converge everywhere 
within the circle \z — zo| < R (including the point zo) and they, therefore, repre- 
sent analytic functions within this circle. The solutions wi and u >2 are said to 
be regular. It follows from the fact that ri and r 2 are integers that a vanishes 
whenever pi — p 2 ^ 0, ± 1, ±2, . . . since it vanishes when 8x — 8 2 has this 
property. 

We have already seen that equation (6-1) will always possess two regular 
solutions (with px = p 2 = a = 0) at the point zo whenever zo is an ordinary point 
of this equation (i.e., if p(z) and q(z ) are analytic at zo). We shall now deter- 
mine certain conditions which the coefficients of equation (6-1) must satisfy 
at an isolated singular point zo if this equation is to possess two regular solutions 
about zo. 

Since wi(z) and w 2 (z) are linearly independent in 0 < |z — zo| <R, their 
Wronskian (see section 5.9.3) 


W{z) = 


W\ (z) w 2 (z) 

Wj(z) w'(z) 


(6-23) 
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cannot vanish at any point z of this domain. On the other hand, putting 

oo 

gi = 2 a^(z — zo) k for i— 1, 2 in equations (6-22) and substituting the 

k=0 

result into equation (6-23) shows that 


W(z) = (z-z 0 )' , * + ' > *- , G,(z) + a(z-zo) 2p ‘- 1 C 2 (z) (6-24) 

where we have set G x = (p 2 - p x )g x g 2 + (z-z 0 > (g x g 2 - g[g 2 ) and G 2 = g\. 
Since g\ and g 2 are single-valued analytic functions in the entire circle 
\z — zo\ < R (including z 0 ), it is clear that G x and G 2 are also functions of this 
type. And since a — 0 whenever the exponents pi + p 2 — 1 and 2pi — 1 in equa- 
tion (6-24) do not differ from one another by an integer, this equation can 
also be written as 


W{z)= (z-zo) k G 0 (z) 


where X is the smaller of the two numbers pi + p 2 — 1 and 2p x — 1 and Goto 
is a single- valued analytic function in the entire circle |z — zo\ < R. Hence, 
Go(z) can be represented by a power series 


Go(z) = An(z — Zo)" 

n^O 

whose radius of convergence is R. But it follows from the fact that %^(z) is 
not equal to zero in 0 < |z — z<>| < R that there must be a smallest integer, 
say m, such that A m ^ 0. Hence, 

r-(z) = (z — zo) x+m G(z) (6-25) 


where 


G{z) = ^ A k+m (z — zo)* = (z — z 0 ) _m Go(z) 

A=0 

is also a single- valued analytic function in |z — zo| < R but not equal to zero 
at zo. And since W(z) does not vanish at any point of the punctured region 
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0 < \z — zo\ < R, we conclude from equation (6-25) that G(z) is not equal to 
zero anywhere within the circle \z — zo\ < R (including the point zo). Hence, 
differentiating (6—25) shows that 


i dsr 

W dz 


d}nW 

dz 


h±ji _ p (z) 

Z Zq 


where 


G'(z 

G(z) 


is a single- valued analytic function in \z — zo| < R since its denominator does 
not vanish in this domain. But equation (5—26) shows that the coefficient 
p(z) of w' in equation (6-1) is related to the Wronskian by 


P(z) 


1 <W 
W dz 


Thus, if equation (6-1) possesses two regular solutions, its coefficient p(z) 
must be of the form 


p(z)=-± ± £ + P(z) (6-26) 

Z Zo 

Similarly, it can be shown by substituting the first equation (6-22) and 
equation (6—26) into equation (6-1) that this equation will possess two regular 
solutions at zo only if the coefficient q (z) is of the form 


<7 00 = 


(z — Zo) 2 + 



+ <?(*) 


where a and /3 are constants and Q(z) is analytic at zo. We have therefore 
shown that equation (6-1) can possess two regular solutions at the point zo 
only if its coefficient p(z) has, at most, a simple pole and its coefficient q(z) has 
at most a pole of order 2. We therefore say that a singular point z 0 of equation 
(6—1) is a regular singular point if 
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(1) p ( z ) is either analytic at zo or has a simple pole at zo 

(2) q{z ) is either analytic at z 0 , has a simple pole at zo, or else has a pole 
of order 2 at Zo. 

Notice, however, that we have not yet shown that equation (6-1) always 
possesses two regular solutions at any regular singular point. 

An isolated singular point of equation (6—1) which is not a regular singular 
point is called an irregular singular point. 

In order to treat the point z=°°, we must, as in section 5.3, introduce the 
new independent variable £=l/z into equation (6-1). Then the point z=°o 
will be an ordinary point, a regular singular point, or an irregular singular point 
if the point { = 0 is, respectively, an ordinary point, a regular singular point, or 
an irregular singular point of the transformed equation. But the change of 
variable £= 1/z transforms equation (6-1) into the equation 

d 2 w , n / „\ dw . 

-^+Po(0-fi+Qo(t)w=0 (6-27) 

where 

Po(Q ) = 2z — z 2 p(z) (6-28) 

^°(^"M!) = Z49 ^ (6-29) 

Thus z=°° is an ordinary point of equation (6-1) if both /%(£) and Qo(0 
are analytic at £ = 0, which is equivalent to saying that 2 z— z 2 p(z) and z 4 q(z ) 
are analytic at z = °°. Similarly, the point z = 00 is a regular singular point of 
equation (6-1) if P o(£) has, at most, a simple pole at £=0 and (?o(£) has, at 
most, a pole of order 2 at £=0, which is equivalent to saying that 2z — z 2 p(z) 
has, at most, a simple pole at z=°o and z 4 q(z) has, at most, a pole of order 2 at 
z= oo. If the point z = oo is an isolated singular point of equation (6-1) but is not 
a regular singular point, it is an irregular singular point. It is clear that the 
coefficients p(z) and q(z) themselves must certainly be analytic at z = oo when- 
ever this point is a regular singular point. 

These definitions are best clarified by considering an example. Thus, the 
differential equation 

z 2 (z— l)w" — (z + l)w' +zw=0 
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can be written as 


w 


J£±1L^+_L_ 

Z 2 U-1) Z(z-l) 


w 


= 0 


Hence, 


pU) 


U+ 1) 

z 2 (z — 1) 


q{z) = 


l 

z(z-l) 


And the function p has a simple pole at z— 1 and a pole of order 2 at the point 
z= 0. The point z = 0 must, therefore, be an irregular singular point. The func- 
tion q has simple poles at z = 0 and z=l. Hence, the point z=l is a regular 
singular point. All other finite points are ordinary points. In order to consider 
the point at infinity we must consider the functions 2z z 2 p(z) and z q{z) 
(instead of the coefficients p and q) which, in this case, become 


2z— z 2 p = 2z + 


z+ 1 

z 1 


z 4 q = 


z— 1 


Hence, 2z — z 2 p has a simple pole and z 4 q has a pole of order 2. And this shows 
that the point at 00 is a regular singular point. 

We shall now show by actually giving a procedure for finding the solutions 
that the differential equation (6-1) always possesses two regular solutions at a 
regular singular point. 


6.3 SOLUTION OF EQUATION ABOUT ORDINARY POINTS AND REGULAR 
SINGULAR POINTS 

Let zo be either an ordinary point or a regular singular point of the differ 
ential equation 


w"+p(z)w' + q(z)w = 0 (6-30) 

Since p has, at most, a simple pole and q has, at most, a pole of order 2 at z 0 , 
the Laurent series expansion for p about zo contains, at most, one negative 
power of z — zo and that for q contains, at most, two. Let Ri and R 2 be the 
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distances between zo and the nearest singular points of p(z) and q(z), re- 
spectively, to zo. Then Ri and R 2 will be strictly positive and 

p(z) = j? Pk(z — zo) A_1 for 0 < \z— z 0 \ <Ri (6-31) 

k= 0 

q(z) = j? qk(z — zo ) k ~ 2 for 0 < \z — z 0 \ < R 2 (6-32) 

fc =0 

The point z 0 will be an ordinary point of the differential equation if, and only if, 

Po = qo = qi = 0 (6-33) 


6.3.L First Regular Solution 

We shall now show that equation (6—30) always has at least one solution 
of the form 



(6-34) 


in some punctured circular region about the point Zo. This will be accomplished 
by first showing that the constants p and a n can always be determined 
in such a way that equation (6—34) formally satisfies the differential equation. 
It will then be shown, a posteriori, that the formal operations were indeed 
justified and therefore that the formal solution is, in fact, a true solution of 
the differential equation. The procedure which we develop by this process can 
then always be used to construct a solution to any given differential equation: 
of the form (6-30) in the neighborhood of any of its ordinary points or regular 
singular points. 

Since the assumed expansion (6—34) must certainly have a leading term 
and since the exponent p is not, as yet, specified, we can always assume 
that matters are arranged so that a 0 ^ 0. We now substitute this expansion into 
the differential equation (6-30) and suppose that the series can be differen- 
tiated term by term. Then 
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2) {n + p){n + p—\)a n (z — zo) n+p ~ 2 

n=0 

+ [2 Pk ( z ~ 2 °) fc_1 X ( n + p)a n (z — z 0 ) n+p_1 

*-fc=0 -I Ln=o 

+ jr 9Ar(z — Z 0 ) fc_2 l [§ «n(z — 2 0 )" +P =0 

-fc=0 J Ln=0 

And after forming the Cauchy products of the series and adding the resulting 
series term by term, we get 

2 ( z — z 0 ) n+p - 2 \{n + p) (n 4- p — l)a„ + V a* [ (k + p)p n -k + <7n-k]| = 0 
»= 0 1 k=o J 

(6-35) 

Now it follows from the uniqueness property of power series that this 
series will vanish for all values of z in some domain only if the coefficients 
of each power of 2 20 vanish individually. Upon equating these coefficients to 

zero, we obtain for n — 0 

F(p)a 0 = 0 (6-36) 

and 

71-1 

F(n + p)a n = — ^ a k [(k + p)p n -k + q n -k ] for n = 1, 2, . . . 

k= 0 

(6-37) 

where we have put 

F(p) = p 2 + (po — l)p + <7o (6—38) 

Since, by hypothesis, do # 0, equation (6-36) implies 
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F(p ) = p 2 + (p o — 1 )p + qo — 0 (6-39) 

This quadratic equation is called the indicial equation. Its two roots are 
called the characteristic exponents of the differential equation at the point 
Zo. Equation (6-37) is called the recurrence relation. 

If the constant p in the assumed solution (6-34) is chosen to be a root of 
the indicial equation (6-38), the coefficient of ( z — zoY~ 2 in equation (6-35) 
will vanish for arbitrary values of the constant ao. And if there is no positive 
integer n for which F(n + p) =0, equation (6-37) can be used to calculate 
successively (starting with ao) the coefficients 78 a n in such a way that the 
coefficients of all the remaining powers of z — zo in equation (6-35) will vanish. 
Thus, when the constants p and a n for n = 1, 2, . . . are determined in this 
manner, the series (6-34) will at least formally satisfy the differential equation 
(6-30) with the constant ao arbitrary. 

Of course, if F(n + p) vanishes for some positive integer n, we have no 
assurance that equation (6-37) can be used to calculate a„. But let the roots of 
the indicial equation (i.e., the characteristic exponents) be denoted by pi 
and p 2 with the notation chosen so that f/L p\ ^ 0k P 2 , and let 

V = pi — p 2 (6-40) 


Then 


9L v 2* 0 (6-41) 

And since a quadratic function can always be expressed as the product of its 
factors, we can write 


F{p) = (p-pi)(p-p 2 ) 


Hence, 


F(n + pi) = n(n+ v) 


(6-42) 


78 Notice that for each integer n the recurrence relation (6-37) expresses a n only in terms of the coefficients 
ate with 0 k < n. Hence, it can be used to first determine ai in terms of ao and then to determine a 2 in terms of ai 
and a 0 . But since a t is known in terms of a 0 , this determines a 2 in terms of ao. By proceeding in this manner, each a» 
can be determined in succession in terms of ao. 
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Then, since equation (6-41) shows that 


^ (n + v) = n + .&* n 


(6-43) 


it follows from equation (6-42) that 

F(n + pi)#0 for n= 1,2, . . . (6-44) 

Hence, when p = pi, the coefficients a n for n— 1,2, . . . can all be calculated 
recursively in terms of ao from the recurrence relation (6-37). Thus, equation 
(6—30) will always possess at least one formal solution of the form (6—34) about 
the point z<>. 

It is still necessary to verify that this formal solution is justified. In order to 

oo 

do this, we must show that the series ^ a„(z — z 0 ) n obtained by the procedure 

71 = 0 

described actually converges. We can then conclude from the theory of power 
series given in section 5.5 that the formal operations of (1) differentiating the 
series (6-34) term by term, (2) forming the Cauchy product of the resulting 
convergent series with the convergent series (6—31) and (6—32), and (3) adding 
the resulting convergent series term by term are justified 79 and, hence, that 
equation (6—34) is indeed a solution to the differential equation (6—30). 

In order to establish the convergence of the series in equation (6-34) for 
the case where p — pi, notice that in view of equations (6-42) and (6-44) the 
recurrence relation (6—37), with p — pi, can be put in the form 

71-1 

^ CLk [ {k pi)pn~k Qn-k\ 

— — for i. = 1,2,... (6-45) 


79 We shall subsequently encounter (in ch. 9) a case where the constructed series actually diverges and hence 
the formal procedures are not necessarily justified. 
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Therefore, 




n - 1 

^ &k [ ( A: H“ pi)pn-k~^~ Qn-k\ 
k — 0 


n(n+v) 


n- 1 

2 I a k | [ | Pn-k | | Pi | + I Qn-k | + k I Pn-k \ ] 
= 0 

n | n + v 


(6-46) 


Let f? = min {R i, /? 2 }, where /?i and R 2 are the radii of convergence of 
the series (6-31) and (6-32) for p and q, respectively. Then/? is strictly positive 
(since R 1 and R 2 are positive) and the Cauchy estimates, given in section 5.5, 
for the derivatives of the analytic functions p and q imply that there exist finite 
positive constants M and N such that 



and 



for k = 0, 1, 2, . . . 


which implies that 


Pk I I Pi | + I qk 


M pi + TV 

Rk 


for A: 2 s 1 


(6-47) 


And since \n+v\ 5 3&(n + v), it follows from equation (6—^ 43) that |n+ v\ 2* n 
for n=l, 2, . . .. Inserting these results into equation (6—46) shows that 



for n= 1,2, . . . 


(6-48) 


Now put P = M\pi\ +N+M+ 1. Then P is a finite number which is larger 
than 1; and for n 2= 1 and k< n, 
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Af pi +N+kM < A/ pi -I- /V+ A/ < p 
n n 


Hence, it follows from equation (6-48) that 


«n - 


I a* I 


/! — 1 
/c=0 


for n= 1, 2, . . . 


(6-49) 


which for n = 1 becomes 


|a,| =£ 



do 


(6-50) 


Suppose that for n > 1 



for 1 =£ k < n 


(6-51) 


We shall now show that this implies that the inequality also holds when 
k = n. Since n can be any integer larger than 1 and since equation (6—50) 
shows that the inequality also holds when n=l, we can then conclude by 
induction that the inequality will hold for every positive integer n. Thus, 
inserting the inequality (6-51) into equation (6-49) shows that 


p h— i pk 

W « £ 2 F |ao1 

k = 0 


(6-52) 


But since P ^ 1, it follows that P l: ^ P n *. And when this is inserted into 
equation (6-52), we find that 



(6-53) 


Hence, we can conclude by induction that this inequality holds for every 
positive integer n. 

We can now use this inequality to show that the series V a n (z — z 0 ) n , 
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which appears in the solution (6-34), converges at least within the circular 
region (of nonzero radius) 


Z — Zo 


R 

2P 


(6-54) 


To this end, notice that within this circle the inequality (6-53) shows that the 
nth term of this series has the property that 


\a n {z-z 0 ) n \=\a„\ |z-z 0 |" « ^ |a 0 | (Jpj = |a 0 | ^ 


(6-55) 


And since the series 


a 0 


oo i 


n-0 


is simply a geometric series, it certainly converges. Hence, in view of the in- 
equality (6-55), a simple application of the comparison test 80 shows that the 

00 

series ^ a„(z — z o)" is absolutely and uniformly convergent at least within 

the circle (6-54). It, therefore, represents an (single valued) analytic function 
within this circle. 

Thus, we have shown that 


Wi(z) = (z — Zo) Pl 2 — z °)” (6-56) 

n=0 


is a solution to equation (6—30) within some circle about zo for arbitrary atf* 
provided the coefficients c$ for n = 1, 2, . . . are computed from the recurrence 
relation (6-45). But the series in (6-56) converges within a circle about zo 
which passes through the singular point of w j (z) nearest tozo. In fact, the radius 
of convergence of this series must be at least as large as the distance/? between 
zo and the nearest singular point of equation (6-30). In order to prove this, 
notice that R\ and /? 2 , the radii of convergence of the series for p and q, re- 
spectively, must be equal to the distance between zo and the nearest singular 


80 See ref. 23. 
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points of these functions- But it follows from the definition of a singular point 
of a differential equation that R = min {R u R 2 }. Hence, p and q are single 
valued and analytic within the punctured circular region A defined by 
0 < \z — zo| < R. Now since wi is a solution to the differential equation, it 
must be possible (as shown in section 5.9.2) to analytically continue this 
function along any simple curve in A. And, therefore, wi cannot have any 
singular points within A. But it certainly has a singular point on its circle of 
convergence. Hence, its radius of convergence cannot be smaller than R. 
And this proves the assertion. 

Since the analytic continuation of a solution of the differential equation 
(6—30) is also a solution of this equation, we can now conclude that th e function 
Wi(z ) given by equation (6~56) converges and satisfies the differential equation 
(6-30) at every point of the domain 0 < \z — Zo\ < R, where R is the distance 
between zo and the nearest singular point of equation (6— 30). 

We have now shown that equation (6—30) possesses a regular solution 
(6-56) about any ordinary point or regular singular point of this equation. 

Before obtaining a second regular solution we shall introduce an example 
to illustrate the procedure described in the preceding paragraphs. Thus, 
consider the equation 


3 z 2 w" + zw' — (1 + z)w = 0 


(6-57) 


Since the coefficients p and q are 

1 


P(z) = 


3z 


1+2 

3z 2 


and since the coefficients (6-28) and (6-29) of the transformed equation are 

5z 


2z — z 2 p = 


*> 


we see that every point is an ordinary point except the point z = 0, which is a 
regular singular point, and the point z = °°, which is an irregular singular point. 

We know that if we seek a solution about the regular singular point 
z = 0 of the form 
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W = ^ dnZ n+p (6—58) 

n=0 

we will obtain at least one and perhaps two solutions to the differential equation 
in which the constant ao is arbitrary. And since the nearest singular point of 
the equation to z — 0 is at °°, we know that the series will converge in the 
entire plane. 

Instead of using the general formulas obtained above it is usually easier 
in any given case to derive the solution from first principles by substituting 
the assumed power series into the differential equation. Thus, after substituting 
equation (6-58) into equation (6-57) and differentiating term by term, we get 

^ [3(n + p) 2 — 2 (n + p) — 1 ]a„z” +p — ^ a„z n+p+I = 0 

71=0 71=0 

In order to collect the coefficients of like powers of z, it is convenient to 
reindex the sums so that the same powers of z appear in each term. Thus, in 
the second sum, we replace the dummy index n by the index k = n + 1 and in 
the first sum we put n — k. And since k = l when n = 0, in the second sum we 
obtain 

2) [3 (A: + p) 2 — 2(A: + p) — 1 ] a k z k+p — a k -iz k+p = 0 

k = 0 k = 1 

Now equating to zero the coefficients of like powers of z gives 

[3p 2 — 2p — 1 ] ao = 0 for k = 0 

and 

[3(&+p) 2 — 2(A;+p) — l]afc=afc-i forA == l,2, . . . (6—59) 

Since ao ^ 0, the indicial equation is 

F(p) = 3p 2 -2p-l 
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And its roots are 

p x =l and p 2 =l/3 (6-60) 

When p= 1, the recurrence relation (6-59) can be solved for all values of k 
and can be written as 

afc= &(3£ + 4) for k= 1, 2, . . . 
or writing out the first few terms 

_ a 0 

ai “ 1 x 7 

_ a l 

° 2 — 2 x 10 


&n - 1 

an n(3n-\-4>) 

In order to determine a„ in terms of ao we multiply together both members of 
these equations to obtain 


1 

a ‘ ' fl2 ' • ' a ”“ (! • 2 . . . n) [ 7 • !0 . . . (3n + 4)] a ° ' a ‘ 


&n-\ 


and then divide through by ai • a 2 . . . a n - 1 to get 


&n 


ao 

n![ 7 • 10 . . . (3 n + 4) ] 


for n = 1, 2, . . . 


(6-61) 


Notice that the term in square brackets is a product whose factors are 
elements of an arithmetic progression. The factors in this progression in- 
crease by 3 in each term. Hence, if we factor out a 3 from each term we obtain 
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7 • 10 • 13 . . . (3. + 4) = 3” (|) (| + l) (| + 2) . . . (| + . - l) - 3» Q. 

where we have the generalized factorial function defined in equation (5-33). 
Upon substituting this into equation (6-61) and then substituting the result 
together with p = 1 into equation (6-58), we obtain the regular solution 


m = A 0 ^ 

n-0 


l 




(6-62) 


where we have put A o = 3a©. 

6.3.2 Second Regular Solution 

We shall now use the regular solution w\ ( z ) obtained in section 6.3.1 to show 
that equation (6-30) always possesses a second regular solution about any 
ordinary point or regular singular point z o. In order to do this we use the results 
obtained in section 5.9.3, which show that 

, . / x f dz n >U)dz 

Wi{z) — w\ (z) J |" M;i ^ z )j 2 e ' (6-63) 

is a solution of equation (6-30) which is linearly independent of Wi(z). We have 
omitted the arbitrary constant c in equation (5-28) since we may assume that 
it has been absorbed into the arbitrary constant a 0 which multiplies wi (z) . 

Now upon integrating equation (6-31) along any path in the punctured 
circular region 0 < jz — Zo| < Ri, we obtain 



= p 0 ln(z — z 0 ) +c 0 + go{z) 


(6-64) 


where 


go{z) = J Jy (z-z 0 ) fr 
and Co is an arbitrary constant. 


(6-65) 
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Since the series ^ pk(z — zo) k converges, the series (6-65) must also con- 

k= I 

verge (section 5.6). Hence, ga{z) is an analytic function in a circle centered at 
zo. It, therefore, follows from equation (6-56) that 

r e ; (z-z 0 )- 2p 'e-p « ln ( ‘-*« ] g{z) 

[u>i(z)J 2 


= (z- z 0 )- (2pl+Po) g(z) (6-66) 


where we have put 

p z) oo 

g(z) = e~ c ° - and gi(z) = £ a^(z~z 0 ) n 

Oi(z )] 2 «=o 

Since a), 1 * # 0, there is some neighborhood of the point zo in which g x (z) 
is never equal to zero And since g\ (z) is analytic at zo, it follows that the func- 
tion [gi(z)] -2 is also analytic and not equal to zero 81 at zo. Similarly, it follows 
from the fact that go(z) is analytic at zo that exp [— go(z)] is analytic and non- 
zero 81 at zo. Hence, the function g(z) is analytic and not equal to zero at 
zo. It can, therefore, be expanded in the power series 


g(z) = V (z — Zo) ” with a 0 # 0 

n - 0 


(6-67) 


which converges in a circle of finite radius about zo. 

Now since the two roots p x and p 2 of the quadratic indicial equation (6-39) 
are given by 

— (po— 1)± V(po — l) 2 — 4go 

2 

it follows that pi + p 2 = l— po- And therefore, in view of definition (6-40), 

2pi + po=i'+l (6-68) 


81 Notice that [#i(z )]~ 2 and exp [— £ 0 (z)] can only equal zero at points where gi(z) and go(z), respectively, 
become infinite; i.e., at singular points of these functions. 
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When equations (6-67) and (6-68) are substituted in equation (6-66) 
and the resulting expression is substituted into the integrand of equation 
(6-63), we find that 


f i 

w 2 (z) = W\(z) J ^Oin(z-zo) n dz 

And it follows from the fact that the power series converges that we can 
interchange the order of summation and integration (section 5.6) to obtain 


w 2 (z) =< 


~ n n — v 

n — 0 


(z—zo) n ~ v for y 0, 1, 2, . 
(z — zo) n ~ v + Wi(z)a„\n (z — z 0 ) 


(6-69) 

for v = 0, 1,2, . . . 


Since these series converge, they represent analytic functions within 
their circle of convergence. Therefore, the product of either of these series 
with the series in equation (6-56) is again an analytic function which can be 

expanded in a power series ^ a ( f (z — zo) n with a nonzero radius of converg- 

71=0 

ence. Hence, in view of definition (6-40), it follows from equation (6-56) that 
equation (6-69) can be put in the form 


oo 


w 2 (z) = (z — zo) p2 V a < g ) (z — z 0 ) n +aw 1 (z) In (z — zo) 

n=0 


(6-70) 


where a=0for v ^ 0, 1, 2, . . . 

By repeating the argument used in section 6.3.1 to determine the size of 
the circle of convergence of the solution (6-56), we can again show that 
the function w 2 (z) given by equation (6-70) converges and satisfies the differ- 
ential equation (6-30) at every point of the domain 0 < |z — zo| < R, where R 
is the distance between zo and the nearest singular point of equation (6-30). 

We have now proved that the linear second-order differential equation 
(6-30) possesses two linearly independent regular solutions at every ordinary 
point and at every regular singular point. 

In order to show how to construct the solution (6-70), it is necessary 
to consider certain cases separately. 
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6.3.2. 1 Case (i): v # 0, 1, 2, . . . — First, consider the case where the 
difference in the characteristic exponents v is not equal to an integer. Then 
a = 0 in equation (6-70), and this solution is of the same form as the trial 
solution (6-34) with p = p 2 . Hence, if equation (6-70) is substituted into the 
differential equation (6-30), we will find, as before (since p 2 already satisfies 
the indicial equation), that the coefficient a\ ^ must satisfy the recurrence rela- 
tion (6-37) with p = p 2 . But since equation (6-40) and the equation following 
equation (6-41) show that F(n + p 2 ) = n(n — v), the recurrence relation (6-37) 
with p = p -2 can be written in the form 


n(n — v)a n = ~'£aic[(k + p 2 )pn-k + Qn-k] forn 1,2, . . . (6-71) 

o 


And since v is not equal to an integer, the coefficient of a n never vanishes. 
Hence, this equation can be solved recursively to determine the coefficients 
aty in exactly the same way that the coefficients in the first solution (6-56) 
were determined. Nothing new is involved. In fact, in solving the recurrence 
relation in this case it is usually easier to leave p unspecified as long as possible 
and to determine the coefficients of the first and second solutions 
simultaneously. 

For example, the left member of the recurrence relation (6-59) of the 
differential equation (6-57) considered in section 6.3.1 can be factored to 
obtain 


CLk — 1 

afc= (p + *-l) (3p + 3£ + l) 


forA;=l,2, . . . 


or, writing out the first few terms, 
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_ do 

ai_ p(3p + 4) 


Oi 

(p+1) (3p + 7) 


&n - 1 

+ 1) (3p + 3n + 1) 

And upon multiplying together both members of these equations we obtain 


CL\ CL% U3 • • • 


&0 * * fl 2 • • • — 1 

[p(p+l) . . . (p + n — 1)] [(3p + 4) (3p + 7) . . . (3p + 3n+l)] 


Hence, 


flo 

a " [p(p+l) . . . (p + n — 1)] [(3p + 4) (3p + 7) . . . (3p + 3n + 1 ) ] 

The characteristic exponents pi and P 2 are given by equation (6-60). If we 
set p = pi = l in this equation, we obtain the coefficients of the first solution. 
But if we set p = p 2 , we obtain' the coefficients 
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do 



do 

”[(-1) -2 -5 . . . (3n — 4)]n! 

of the second solution. After using the generalized factorial function (eq. 
(5-27)) to get 

(-1) 2 5. .<3.-4>-3-(-|)(-j-+l)(-y+2)- ■ -(-y+.-l) 

-H). 

and then substituting the results with p=l/3 into equation (6-58), we find 
that the second solution to equation (6-57) is 



where we have put Bo — 6o3 _1/3 . 

6.3.2.2 Case ( ii ): v=l, 2, .. . and a = 0. — In this case the second 
solution (6-70) will also be of the same form as the trial solution (6-34) 
with p=p 2 . Hence, its coefficients a ( „ 2) for n= 1, 2, . . . must again be deter- 
mined by the recurrence relation (6-71) with p = p 2 . But since v is a positive 
integer, the left-hand side of this equation will vanish when n=v. The recur- 
rence relation for a P can therefore not be satisfied unless the right-hand side 
of this equation 

V—l 

^ ak[(k + P2) p P -k + Qv-k] 
k = 0 

also vanishes, in which case it will be satisfied automatically for all values 
of a„. We then obtain a solution involving two arbitrary constants. And since 
this solution must be a general solution to the differential equation, it will 
contain the solution obtained for p = pi. 
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This situation will always occur in the important special case when z o 
is an ordinary point of the differential equation. For in this case, equations 
(6-33) and (6-39) . show that the indicial equation is 


F(p)-p(p- 1)=0 


Hence, the characteristic exponents are p\ — 1 and p 2 = 0; and their difference 
v = pi — p 2 = 1 is a positive integer. 82 But it follows from equation (6-33) 
that the recurrence relation (6-71) for ai becomes 

1 X 0 X a\ = ao [(OX p x ) + 0] 

and this is automatically satisfied for any choice of a x . The recurrence relation 
(6-71) with n = 2, 3, . . . will then uniquely determine the remaining a n - 
We therefore obtain in this case a solution which involves two arbitrary 
constants. 

For example, consider the equation 

(z 2 — 1) w"-\- 6zw' + \w = 0 (6—72) 

Since its coefficients p and q are 

we see that the only singular points of this equation are regular singular points 
atz=l,z = — 1, andz = o°. 

We know that if we seek a solution about the ordinary point z = 0 of the 

form 

w= anZ n (6-73) 

n=0 


we will obtain a general solution to the differential equation in which the 
coefficients do and ai are arbitrary constants. And the series will converge at 
least within the circle |z|< 1. 


82 That v must be an integer in this case could easily be anticipated from the fact that the characteristic exponents 
themselves must be integers if the solutions are to be analytic at z — zo. 
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Thus, after substituting the expansion (6-73) into equation (6-72), differ- 
entiating term by term, collecting terms, and factoring the coefficient of z n , 
we obtain 

— n{n— l)a„z n ~ 2 + (n + 4) (ra+ l)a„z" = 0 

n=0 n=0 

which becomes, upon shifting the index in the second sum, 

— ^ k(k — l)a k z k ~ 2 + ^ (A: + 2) (A: — l)afc_ 2 Z fc_2 = 0 

fc =0 fc =2 

We now equate to zero the coefficients of like powers of z to get 

Ox (— 1 ) a 0 = 0 1 X 0 X «i = 0 

and 


— k(k — l)afc+ (A: + 2) (k — \)a k -<i — 0 for& — 2,3, . . . 

The first two equations merely serve to show that a 0 and a t are arbitrary, as 
we already know. And since the coefficient of a* is not zero for k 2* 2, we can 
write the remaining equations as 

ak = -h±la k - 2 for k = 2, 3, . . . (6-74) 

k 

This equation shows that each succeeding a k is determined from the aj 
whose subscript is 2 lower than its own. Thus, this recurrence relation will 
ultimately determine the a k with even values of k in terms of ao and the a k with 
odd values of k in terms of a\. It is, therefore, convenient to consider separately 
the equations for even and odd values of k. First, upon writing out these 
equations for even values of k beginning with a 2 , we find that 
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d'ij : 


20 + 1 ) 
2 




Similarly, for odd values of k beginning with a 3 , we find that 


5 

#3 — — g 0,1 

7 

#5 ~ — g #3 


_ 2y+ 3 

a2j+1 27+1 


a 2j-i 


Multiplying together both sides of the equations for even values of k and 
dividing through by a 2 • a 4 • a 6 . . . a 2 j -2 gives 

_ [4-6-8 . . . 20+1)] 
a2j 2 • 4 • 6 ... 2j a ° 

which becomes, upon removing the common factors of the numerator and 
denominator, 
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« 2 /= (;'+ l)a 0 for j= 1,2, . . . (6-75) 

And upon proceeding the same way for the odd values of k, we obtain 

2;'+3 

« 2 j+i = — 2 — ai for j= 1,2, . . . (6-76) 

Since equations (6-75) and (6-76) show that the coefficients of even and 
odd powers of z are given by different expressions, it is convenient to first re- 
arrange the series (6-73) into two series, one containing the even powers ofz 
and the others containing the odd powers. This is legitimate since every 
rearrangement of a convergent powers series converges to the same sum. Thus, 
equation (6-73) becomes 


W= 2 a ? i z2J+ 2 a 2j+l2 2j+1 

j=0 j = 0 

And upon substituting equations (6-75) and (6-76) into this expression, we 
obtain the general solution 


oo oo / 2 I 3 \ 

w = a 0 2 (j+ l)z 2j ' + a\ ( — - — Jz 2 j +l 
j = o ' 

to equation (6—72). It is easy to see that both series converge in the circle 
| z | < 1 and diverge outside this circle. 

In both the examples given thus far in this section, explicit expressions 
for the general terms in the series were obtained. Also the recurrence relations 
in both examples (see eqs. (6-59) and (6-74)) involved only two different 
coefficients; whereas, in general, the recurrence relation will involves different 
coefficients (see eq. (6-37)). It is, in fact, true in general that there is little hope 
of obtaining an explicit expression for the solution unless a two-term recurrence 
relation is obtained. This is not necessarily a limitation on the method because 
any recurrence relation can be used to successively calculate numerically as 
many terms of the series as desired. In fact, it is in the cases where many- 
term recurrence relations occur that the general convergence theorems are 
particularly useful since, without having an explicit expression for the general 
term, it is not possible to tell from an infinite series itself whether or not it is 
convergent. 
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6 . 3 . 2.3 Case (iii): v = 0, 1 , 2, . . . and a#0. — In this case the loga- 
rithmic term will be present. Notice that in case (ii) we did not include v=0. 
The reason for this is that when *>=0 the characteristic exponents are equal 
and therefore there is only a single recurrence relation. Hence, there; can only 
be one regular solution of the type (6-34) and the logarithmic term must be 
present in the regular solution (6-70). When v = 1 , 2, . . ., a will not be equal 
to zero if, and only if, the recurrence relation (6-71) cannot be solved for a„. In 
this case, no generality will be lost if we set 83 a — 1. The solution can always be 
determined by substituting equation (6-70) into the differential equation, using 
the fact that w\ is a solution to simplify the result, and then setting to zero the 
coefficients of the various powers 84 of z — z<> to calculate the coefficients 
a ( „ 2) recursively. 

The procedure is best illustrated by considering an example. Thus, the 
differential equation 


zw" — zw' — w = 0 (6-77) 

has a regular singular point at z = 0 and an irregular singular point at z = ®. 
We know that this equation will possess at least one solution of the form 


w= ^ a n z n+p 

n= 0 


(6-78) 


about the point z = 0 and that this solution will converge in the entire complex 
plane. Upon substituting the expansion (6-78) into equation (6-77), inter- 
changing the order of summation and differentiation, shifting the indices, and 
collecting terms, we obtain 


S (n + p)(n + p — 1 )a n z n+p ~ 1 — ^ (n + p)a n -iz n+p ~ 1 = 0 

n= 1 


Equating the coefficients of like powers of z to zero gives for n = 0, since ao 
is arbitrary, 


pip - 1) = 0 


(6-79) 


83 Since w\ is detemined only to within a constant factor. 

84 The logarithmic terms will always cancel out. 
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and 


(n + p — l)a„ = a n -i for n = 1, 2, . . . (6-80) 


The roots of the indicial equation are pi = 1 and p 2 = 0; hence, the difference 
of the roots v = pi — p 2 = 1 is an integer. First, consider the case where 
p = pi = 1. The recurrence relation is (see eq. (6-76)) 

&n = o.n -1 for n= 1,2,... (6-81) 

After writing out the various terms of equation (6-81), multiplying the cor- 
responding members of these terms together, and dividing out common factors, 
we get 

1 1 

dn loo ^0 t do 

1 • 2 • 3 . . . n nl 
Hence, we find that the solution for p = 1 is 


wi = a 0 ^ A z tl+1 (6-82) 

n = 0 n ' 

which can easily be summed to obtain 

w i = doze* (6-83) 

Next, consider the case where p = p 2 = 0. The recurrence relation is 
( n — 1 )a n = a n -i- Since ao ^ 0, it is clear that this equation cannot be solved 
for ai. Hence, the differential equation must possess a logarithmic solution. 
We, therefore, seek a second solution of the form 


<x> 


W 2 = ^ dnZ ' 1 + w 1 (z) In z 

n = 0 


(6-84) 


Upon substituting this expansion into equation (6-77) and recalling that w x 
satisfies the differential equation, we find that the resulting coefficient of In z 
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vanishes. Then shifting the index and collecting terms in the summation gives 
f [n(/t — l)a„ — na„_i]z w-1 = 0 (6-85) 

\ 2 / n=l 

Hence, after substituting in equation (6-83), we get 


a 0 (l + z)e*+ [n(n — l)a„ — na n -i]z n - l = 0 (6-86) 

n= 1 

00 1 

But replacing e z by its series representation ^ — z" and then shifting the 
indices shows that n=0 


Then upon equating to zero the coefficients of like powers of z, we find that 

a.Q= do for n= 1 


and 


(In — 1 dp 

n — 1 (n — l)(n — 1)! 


for n = 2,3,4, . . . 


(6-87) 


where ao and ai are arbitrary. Actually, we could simplify matters by setting 
ai = 0, but we shall carry d\ through as an arbitrary constant in order to 
demonstrate this. 

Upon writing out the first few terms of equation (6-87), we get 


__ a\ do 
1 “ 1 • 1 ! 

_ dp __ (ll dp A 1 \ 

2 _ 2 • 2!“ 1 • 2 2! v 2/ 
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g 3 ao _ a 1 flo /, 1 1\ 

04 3 3-3! 1-2-3 3! \ 2 3/ 

Hence, we conclude by induction that 

a n = t — ^ 77 7 — t — ^777 H n - 1 for n = 2 , 3-, . . . ( 6 - 88 ) 

{n — 1 ) ! (n — 1 ) ! 

where we have defined H n to be the partial sum of the harmonic series. That is, 

( 0 for n = 0 

► (6-89) 

11 1 n 1 

I+ 2 + 3 + --- + ~„=I,k for„=l,2 J 

k= 1 

Substituting equations (6-83) and ( 6 - 88 ) into equation (6-84) yields 

w 2 = a 0 | (ze* In z) + 1 — j? jji + ai 2 Z " 

which becomes, after shifting the indices in the sums, 

Wz — do j" (ze* In z) + 1 — V — H„z n+1 + a i 2 ~ i z ” +1 

L n - -1 n=o n - 

Notice that the second sum is essentially the solution w\. Hence, the term in 
square brackets must be a solution which is linearly independent of w\. This 
(i.e., the bracketed term) is the solution we would have obtained if we put a\ = 0 . 

6.3.3 Summary 

We have now shown how two linearly independent solutions to the differ- 
ential equation (6—30) with analytic coefficients can be found in the neighbor- 
hood of any regular singular point zo. First the indicial equation (6-39) is 
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determined. If the difference v of the two roots of this equation is not an 
integer (including zero), two solutions of the form (6—34) are obtained, one 
for each root of the indicial equation. However, if v is an integer, the recurrence 
relation (6-37) for the term a v must be investigated with p = p 2 - It will either 
automatically be satisfied for any a v , or it will be impossible to satisfy for any 
a v . If it is automatically satisfied, ao and a v will be arbitrary; and we will 
obtain two linearly independent solutions with p = p 2 . If v=0 (i.e., pi = P 2 ) 
or if v is an integer and it is impossible to satisfy the recurrence relation, the 
solution will have the form (6—70) and can be found by substituting this form 
into the differential equation. The point at infinity is treated by making the 
change in variable 2 = l/£. 

6.3.4 Computation of Indicial Equation 

Since the nature of the solutions of equation (6-30) in the neighborhood 
of a regular singular point 20 is so dependent on the roots of the indicial equa- 
tion (characteristic exponents), it is useful to be able to determine this equa- 
tion without first finding the expansions (6—31) and (6—32) of the coefficients 
about 20 . 

Equation (6-39) shows that this is accomplished once po and <jf« are known. 
But it follows from equations (6-31) and (6-32) that 


oc 

(2-2o)p0z)=Po+ 2 Pk(z~Zo) k 


k= 1 


(2 — z 0 ) 2 q(z) = qo+ ^ q k (z—z 0 ) k 


k=l 


Hence, 

p 0 = lim [( 2 — 2o)p(z)] and qo= lim [( 2 — z 0 ) 2 q(z)] 

Z— *Zq Z * Zo 

Thus, for example, the equation z 2 w" + z(2 + 3z)w' + (1 — 2 ) 1 *;= 0 has a regular 
singular point at 2 = 0 and 


P(z) = 


2 + 3 2 


and 
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Hence, 


Po = lim z (- — — \ = 2 and 

z— »0 \ Z ) 


qo = limz 2 =1 

z-*0 \ Z 2 


and the indicial equation (6-39) is 

p 2 + (po — l)p + <7o = p 2 + p+ 1 = 0 

If the regular singular point zo is the point at infinity, we make the change in 
variable z= l/£ and investigate the point £ = 0. This leads to the relations 


Po 


— lim ( 




1 /I 


y yo P 


*0 L U C 2 \C 



qo = lim £ 2 
«-*o 


1 (l 
c 4 q vc 


And upon changing back to the original variable z, they become 
Po = lim [2 — zp(z)] and g 0 = lim z 2 <7(z) 


Z *°° 
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Riemann-Papperitz Equation and the 
Hypergeometric Equation 


7.1 THE FUCHSIAN EQUATION 

Having shown how to construct solutions to a differential equation with 
analytic coefficients about its regular singular points, it is natural to study 
those equations whose only singularities (in the extended plane) are regular 
singular points. Such equations are called F uchsian equations. We shall restrict 
our attention to Fuchsian equations of the second order; that is, equations of 
the form 


w" + p(z)w' + q{z)w= 0 (7-1) 

This equation can have, at most, a finite number of singular points. In order to 
prove this, notice that since the only singularities of the coefficients p and q in 
the extended plane are poles, these functions must be rational (see section 
5.4). But since the number of singularities of a rational function which occur 
at finite points of the plane is equal to the number of distinct zeros of its poly- 
nomial denominator, it follows that p and q have, at most, a finite number of 
singular points. However, every second-order equation must have at least one 
singular point. For if equation (7-1) had no singularities in the extended plane, 
the coefficients p and q would be everywhere analytic; and, hence, by Liou- 
ville’s theorem, they would be constants. However, even if the constant value of 
p were zero, the coefficient 2 z — z 2 p(z) of the transformed equation (see 
eq. (6—1 28) ) would still have a simple pole at infinity. But this would contradict 
the assumption that equation (7—1) had no singularities. Hence, we conclude 
that this equation must have at least one singularity. 
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Since the rational function p has, at most, simple poles and the rational 
function q has, at most, poles of order 2, it is clear that any Fuchsian eauation 
with not more than m singular points in the finite plane must have coefficients 
of the form 


p(z ) = 7 W 7 V 

(z — Zi)(z — Z 2 ) ■ ■ ■ ( Z — Zm ) 

( v QM 

q Z (z — Zx) 2 (z — Z 2 ) 2 . . . (z — Zm ) 2 

where P and Q are polynomials. 

7 . 1.1 Fuchsian Equations With, at Most, Two Singular Points 

There are two possible forms which a Fuchsian equation with, at most, 
two singular points can have. First, consider the equation which has, at most, 
one singular point in the finite plane (with a possible singular point at infinity). 
Its coefficients must take the form 


p(z) = 


Pjz) 

z — a 


Qiz 

(z-a) 


But if the coefficients of the transformed equation 2z — z 2 p and z 4 q are to have 
at most a simple pole and, at most, a pole of the order of 2, at infinity the 
polynomials P and Q must both be constants, say A and B , respectively. Hence, 
the differential equation must be of the form 


w" + 


z — a 


w 


B 

( z-a)' 


w = 0 


(7-2) 


Next, consider the case where the singular points are both in the finite 
plane. Then the coefficients must be of the form 


p = PiU „= QM 

(z — a)(z — b) ^ (z — a) 2 (z — b) 2 


But the coefficients 2 z — z 2 p and z 4 q of the transformed equation will be analytic 
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at z = 00 only if there are constants C and D such that P — 2z + C and Q = D. 
Hence, in this case the differential equation must be of the form 


w" + 


2z+C 


( z — a) (z — b) 


w' + 


D 


(z — a) 2 (z — ft) 2 


w— 0 


(7-3) 


Thus, a Fuchsian equation which has, at most, two singular points must be 
either of the form (7-2) or of the form (7-3). It is easy to verify that the 
general solution of equation (7-2) is 


fCi (z — a) p i + C 2 (z — a) p 2 if (A — l) 2 — 4B 9^ 0 


w = 


[ (z — a) p i [Ci + C 2 In (z — a) ] 


if 04 -l) 2 - 4fl = 0 


where Ci and C 2 are arbitrary constants and p x and p 2 are the roots of the 
indicial equation p 2 + (A — l)p + B = 0. Since the change in independent 
variable t=\j{z — b) transforms equation (7—3) into an equation of the form 
(7-2), it follows that the general solution to equation (7—3) can also be ex- 
pressed in terms of elementary functions. 

Thus, the solutions of any Fuchsian equation which has no more than 
two singular points in the extended plane are elementary functions. 


7.1.2 Fuchsian Equations With, at Most, Three Singular Points 

In order to obtain a Fuchsian equation whose solution is not elementary, 
we must consider an equation which can have three regular singular points. 
If we require first that these three points, say a, b , and c, all lie in the finite 
plane, the coefficients of the differential equation must have the form 


P ( 2 ) = 


Pjz) 

(z — a)(z — b) (z — c) 


Q(z) 


0(z) 

(z — a Y 2 (z ~ b) 2 (z — c) 2 


where P and Q are polynomials. And since the point z = °° is to be an ordinary 
point, the coefficients of the transformed equation 


2z — 


zJP 

(z — a) (z — b ) (z — c) 


and 


to. 

(z — a) 2 (z — b) 2 (z — c) 2 
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must be analytic at z = oo. But this can occur only if P and Q are quadratic in 
z and the coefficient of z 2 in P is 2. Since the degree of the numerator is less 
than the degree of the denominator, it follows from the elementary theory 
of partial fractions that there exist constants A a , Ab, A c , B a , Bb, and B c such that 

P ( Z ) Ag | Ab | Ac 

(z — a) (z — b) (z — c) z—a z — b z — c 


and 


<?(*) 


B° Bb _|_ Be 


(z — a)(z — b){z — c) z — a z — b z — c 
The differential equation, therefore, takes the form 
d 2 w , ( A a , At 

dz 2 


+ / Ag + Ab + Ac \ dw 
\z — a z — b z — c) dz 

Ba . Bb , Be 


+ 


/ Ba | Bb , Be \ 

\z — a z — b z — c) 


w 


b z — c ) (z — a){z— b)(z — c) 
And since the coefficient of z 2 in P(z) must equal 2, it follows that 


= 0 


(7-4) 


A a A- Ab A- A c — 2 


(7-5) 


Otherwise the constants A a , Ab, A c , B a , Bb, and B c are arbitrary. 

Now at each of the points a, b, and c there is a pair of characteristic 
exponents, say (a', a"), (/3\ /3"), and (y\ y"), 85 respectively. In order to find 
a relationship between these exponents and the A’s and B’ s, notice that since 


po = lim (z - a)p(z) = lim A„ + A b (j^fj A- A c 


= A a 


85 The pairs (a', a"), (/3\ /3"), and (y\ y") are solutions of the indieial equations at the points a, 6, and 
c, respectively. 
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and 


qo = lim (z — a) 2 q(z) = lim B a + B b (- — f) + B c (- — -) 7 77^ 

*-> a z ^ a L \z — 0/ \z — c) J (z — b) l 


b){z-c) 


Bg 

( a — b)(a — c ) 

at 2 = a, the indicial equation at this point is 


p 2 + (A a — l)p + 


B a 


(a — b)(a — c) 


= 0 


But it follows from the properties of the roots of quadratic equations that the 
roots a' and a" of this equation satisfy the relations 


a' + a"= 1 -A a 


a' a" = 


B a 


( a — b) ( a — c) 


And we can show in exactly the same way that 


p + ?=\-Ai 


073 " = 


B b 


( b — a) (b — c) 


y + y"=l -Ac 


y'y". 


B c 


( c — a ) (c — b) 


Upon using these relations to eliminate the A’s and B ’ s in equation ( 7 - 4 ), 
we arrive at the Riemann-Papperitz equation 


d*w | / l-a'-y | 1-0' -/3 H | 1 -Y-y^ dw [ [ aVjg-ft) (a-c) 
\ z — a 


az l \ z — a z — b z — c/dzl z — a 


-=0 


ppr{b-a) (b-c) | y'y" (c — a) (c — 6) 1 w 

z — b z — c J (z — a) (z — b) (z — c) 

(7-6) 
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Eliminating the A’s in equation (7-5) shows that the characteristic ex- 
ponents at the singular points of equation (7-6) satisfy the relation 

a' + a"+/3' + /3"+y' + y"= 1 (7-7) 

More generally, the sum of all the characteristic exponents at the singular 
points of any second-order Fuchsian equation is two less than the number of 
such points. This relation is called the Fuchsian invariant for the equation of 
order 2. 

In a similar way, it can be shown that if a Fuchsian equation with, at most, 
three regular singular points, has one of these points, say c, located at z = °°, 
this equation must be of the form 


dfw / l-a'-a” . l—f'—f" \ dw 

dz 2 \ z — a z — b j dz 


+ f a'q" (a — 6) + fi'ITQ-a) + , ' 
L z—a z—b 


w 


(z — a) (z — b) 


= 0 (7-8) 


where y' and y" still denote the characteristic exponents at c and equation 
(7-7) still holds. Since equation (7-8) can be obtained by formally taking 
the limit c— in equation (7-6), we can say that equation (7-6) holds even 
when c— °°. Then with this understanding, equation (7-6) is the most general 
Fuchsian equation which has, at most, three singular points. 

7.2 RIEMANN P-SYMBOL 

When w is a solution of the Riemann-Papperitz equation whose singulari- 
ties occur only at the distinct points a, b, and c, we shall sometimes write 


w = 



b 

(3' 

13" 


y' 

y" 


(7-9) 


The right side of equation (7-9) is called the Riemann P-symbol. The char- 
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acteristic exponents of the differential equation are listed below the correspond- 
ing singular point. (Any of the singular points can be at infinity. I These ex- 
ponents must, of course, satisfy the Fuchsian invariant 


a' + a"+(3' + (3"+ y' + y”=l 


Finally, the independent variable is located in the fourth column. 

If we interchange any of the first three columns in the P-symbol (7-9), 
the notation still refers to the same equation, There are 3!=6 ways in which 
this can be done. Similarly, interchanging the order of the exponents in any 
given column leaves the meaning of the symbol unchanged. Thus, for each 
arrangement of the points a, b , and c, there are eight arrangements of the 
exponents. Hence, there are 6X8 = 48 different P-symbols which refer to the 
same equation and, therefore, have the same meaning. 

We shall now show that 



b c 

(3' y' + k 

f3" y"+k 


z \ (7-10) 


/ 


when a, b, and c are finite and 



00 


/ 3 ' y' + k 

13" y"+k 


(7-11) 


when c = <x>. 

Equation (7-10) means that if w is a solution of a Riemann Papperitz 
equation and if we make the change of dependent variable 


w = 



(7-12) 
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in this equation, then the transformed equation (which has u as the dependent 
variable) is also a Riemann-Papperitz equation and the location of the singular 
points is left unchanged by this transformation. However, the v.naracteristic 
exponents at the points a and c are altered in the manner indicated by the 
symbols. Equation (7-11) is the corresponding transformation for the case 
where one of the singular points is at infinity. 

In order to prove that equation (7-10) holds, we first substitute equation 
(7-12) into equation (7-6). Then upon noting that 


, ( z-a\ k f , k (a-c) ~ 

’ \z-c ) [ U + (z — a) (z — c) 

'"=(^ 4 * \u"+2k (— — ) 

\ z — c) l \ z — a z — c / 


k (a — c) ( k—\ 
(z — a) ( z — c ) \z — a 



u 


and 


1 l (l+^) 

(z — a) (z — c) (z — k) (z — a) (z — b) (z — c) \ z — kj 


for k = a, b, or c 


we find after combining terms that 


d*u / l-a'-a"+2 k 1-/3'— >3" l-y'-y ,, -2/6 du 

dz 2+ \ z — a z — b z — c ) dz 

, A:) (a^zk) (a-b) (a-c) . j8'j8"(6-a) (b-c) 

+ 7=~a + z-b 


t (y' + k) (y±k) ( c ~ a ) ( c ~ 6 ) 

Z — C 


u 

(z — a) (z — b) (z — c) 


= 0 


But comparing this with equations (7—6) and (7 9) shows that equation 
(7-10) holds. Equation (7-11) follows from equation (7-8) in the same way. 
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Having considered the transformations of the Riemann-Papperitz equa- 
tions under a change of dependent variable, we shall now show how this equa- 
tion transforms under a change of independent variable. In this case, the 
change of variable which transforms a Riemann-Papperitz equation into another 
Riemann-Papperitz equation is the nonsingular linear fractional tra nsformation 

t= 7^TT) where AD — BC # 0 (7-13) 


studied in chapter 5. 

It is again convenient to describe this transformation in terms of the Rie- 
mann P-symbol. Thus, if t is the change of variable (7-13), then 



where a u bu and c x are the images of the points a, b, and c, respectively, under 
equation (7-13). For example. 


Aa + B 

a,- Ca + Z) 

In order to prove equation (7-14) we first recall that (see section 5.3) per- 
forming the transformation (7—13) is equivalent to performing in succession 
not more than four elementary transformations, each of which has one of the 
forms 


t—lz 

(7-15) 

t = k~\~ z 

(7-16) 

_ 1 


z 

(7-17) 


It can now be verified, by substituting in turn each of equations (7-15) to (7-17) 
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into equation (7-6), that in each case the transformed equation is a Riemann- 
Papperitz equation with the same characteristic exponents as equation (7-6) 
and that the singular points of the transformed equation are the images under 
the transformation of the original singular points. Hence, equation (7-14) holds 
for each of the elementary transformations (7—15) to (7—17). But since the 
general nonsingular linear fractional transformation is equivalent to performing 
a succession of these transformations, it is clear that equation (7-12) must hold 
for this transformation also. 

7.3 TRANSFORMATION OF RIEMANN-PAPPERITZ EQUATION INTO HYPER- 
GEOMETRIC EQUATION 

We shall now show that the general Riemann-Papperitz equation (7-6) 
can always be transformed by the application of a number of transformations 
(each of which has one of the forms (7-10), (7-11), and (7-14)) into a certain 
standard equation (canonical form). The usefulness of this result is principally 
due to the fact that the transformed equation, which has a regular singular 
point at the origin, has a two-term recurrence relation at this point. As we have 
seen, this allows us to find an explicit expression for the series solution. 

In order to accomplish this reduction let w be any solution to equation 
(7-6). Then 


a 


w 


= P\ a' 


\Oi 


& 

P" 



where we suppose that the notation has been so arranged that a and b are 
always finite. We first introduce the new dependent variable v by 


w= 


, (z — a\ 
(“) - 
(z — a) a 'v 


if c oo 
if c=° o 


(7-18) 


and then use equation (7-10) or (7-11) to show that 
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( a b 

o er 

oc" -a' /3" 



(7-19) 


We again change the dependent variable by using the transformation 



\(z-bYu 


if c oo 


if c = oo 


(7-20) 


and then apply equation (7—10) or (7—11) to equation (7—19) to show that 


( ° 

u = P\ 0 

Y’-a' 

Now let us use the linear fractional transformation (discussed in section 
5.3) which maps the points a, b, and c into the points 0, 1, and respectively, 
to change the independent variable. Thus, a new independent variable t is 
introduced by 


b c 

0 y' + a' + p' 

y"+a' + /3' 


(7-21) 


t= 



(7-22) 


(When c = °°, we understand t to be the limit of this expression as c— » <», 
which is simply the first term in the right-hand member.) And in view of equa- 
tion (7-14) it follows from equations (7-21) and (7-22) that 



00 


y +«'+£' 

y+<*'+/3' 


(7-23) 
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The transformations (7-18). (7-20), and (7-22) can now be collected into 
the single formula 



(z-a) a ' (z-by'u(t) for c = °° 

(7-24) 


The function u given in equation (7-23) is the solution to a Riemann-Papperitz 
equation which has only four nonzero exponents. Since the sum of these ex- 
ponents is unity, they can be characterized by three constants, say a, /3, and 
y. It is customary to define these constants by 


a = a' + f3' + y r ft — ot' + ft' + y" 


y= l + a'-a" 

( 7-25) 


Then equation (7-23) can be written as 


r 

u=p[ 0 0 

\1— y y — a—fi 


00 

a 

P 


) 


(7-26) 


But comparing equations (7-8), (7-9), and (7-26) shows that the Fuchsian 
equation satisfied by (7—26) is 


d 2 « n — (1— y) 1 — (y— a — /3)1 du [~JQ 

dt 2 + l + t— 1 J dt + [ t 


0 , 1 u 

7(7= ry— 0 


And upon rearranging this equation slightly, we obtain the hypergeometric 
equation of Gauss , 


t 


( 1-0 


dfu 

d t 2 


[y-(a + p+l) t]^-apu=0 


(7-27) 


Its solutions, which will be obtained subsequently, have been extensively 
studied. 
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We have, therefore, shown that any Riemann-Papperitz equation can be 
transformed into the hypergeometric equation (7-27) by introducing a change 
of variable of the type ( 7-24 ) . 

Hence, once the solutions to the hypergeometric equation (7-27) have 
been found, we can find the solutions w to any Riemann-Papperitz equation 
simply by using the transformation (7-24) to express them in terms of the solu- 
tions to the hypergeometric equation. 

However, in any specific case, it is usually more convenient to rederive 
this transformation by using the Riemann P-symbol. Thus, consider the 
Riemann-Papperitz equation 

2z 2 (z-2)w"-z(3z-2)w' + 2(z-l)w=0 (7-28) 


For this equation, 


P(z) 


3z — 2 
2z(z — 2) 


9(2) 


z— 1 
z 2 (z — 2) 


Hence, its singularities are located at the points z=0, z = 2, and z=°°. 
At z = 0 


,. r 3 z -2 11 ,.2-11 

Po= r 2(1=2) J = “2 ,0= lL m ,^2 = 2 

and, therefore, the indicial equation at this point is (see eq. (6-39)) 

p*-|p+5=° 

The roots of this equation are 1 and 1/2. 

At z = 2 


po=lim 
z— * 2 


(• 


3z-2 

2z 



,. (z — 2) (z— 1) . 

9o=hm- ^ -=0 


*-» 2 


and, therefore, the indicial equation is 

p 2 — 2p = 0 
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The roots of this equation are 2 and 0. 
At 2=00 


Po = lim [2 — zp(z )] 


= lim 

Z — >00 


32 — 2 
2(2-2) 


7 

2 


qo = lim z 2 q(z) — lim 


2-1 


>00 z ■ 


1 


and, therefore, the indicial equation is 

P 2 + |p + 1 = 0 


The roots of this equation are — 1/2 and — 2. Hence, the solutions to equation 
(7-38) can be denoted by 



It now follows from equation (7—11) that 



\ 


/ 


And it can be concluded from equation (7- 14) that 


/0 1 


00 


w = z ll2 P I 0 


0 


0 


Vl/2 


-3/2 


\ 

t 

/ 
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where t = zj 2. But upon comparing this with equation (7-26), we see that 

a = 0 /3 = -| y=| (7-29) 

Thus, the solutions to equation (7-28) can be expressed in terms of the solu- 
tions to the hypergeometric equation (7-27) with the constants a, /3, and y 
given by equation (7-29). 

7.4 HYPERGEOMETRIC FUNCTIONS 

It remains to determine the solutions to the hypergeometric equation 
(7—27). We shall first seek a solution about the regular singular point z= 0. 
Since the P-symbol for equation (7-27) is given by equation (7—26), it follows 
that the characteristic exponents at z = 0 are 0 and 1 — y. Suppose first that y 
is neither zero nor a negative integer. Then the methods developed in chapter 6 
can be used to obtain a power series solution 86 of the form 

U= jr dnt n (7-30) 

n = 0 


which corresponds to the characteristic exponent zero. Upon substituting this 
into equation (7—27), collecting terms with like powers of t, and shifting the 
index n, the lowest one present, we obtain 

^ n(y 4- n — l)a„t n_1 — ^ (a+ n — 1) (/3 + n — l)a n -if" 1 = 0 

n=0 n = l 

Hence, the n = 0 term shows that a 0 is arbitrary and we obtain the two- 
term recurrence relation 


_ (a + n — l)Q3 + n — 1) 

a n~ , . dn - 1 

n(y+ n — 1) 


forn=l,2, . . . 


(7-31) 


where the division by n(y+n — 1) is permitted since we have required that y 
be neither zero nor a negative integer and, therefore, the denominator can never 
vanish. Upon writing out equation (7-31) for successive values of n, multiply- 
ing the corresponding members of these equations together, and dividing 

86 Recall that it was shown in section 6.3 that the power series solution corresponding to the largest characteristic 
exponent always exists. 
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out common factors, we find that 

[a(a+l) . . . (a+n — 1)] [/3(/3+ 1) . . . (/3+n — 1)] 

°"~ »![y(y+l) . . . (y+n-1)] 

And by using the generalized factorial function defined in equation (5—33), 
this can be written in the more concise form 


O'H 


(«)n(j3)«l 
n\(y) n 


do 


Hence, the solution (7—30) is 


u=aoF(a, f$; y; t ) 


(7-32) 


where we have put 


F(a, j8; y; t) s 2 for y * 0, - 1, -2, . . . (7-33) 

jfr 0 n.{y) n 

The function F(a, (3; y; t ) is called the hypergeometric function, and the 
notation used herein is universally accepted. The reason for calling this 
function the hypergeometric function is that when a= 1 and j 3=7, this series 
(7—33) reduces to the geometric series. Thus, 

F(l, y; y; t) = Y~~ t 
In fact, it follows from the binomial theorem that 

F(a, y; y; t ) =F( y, a ; y; t) = (7-34) 


for any number a. 
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Notice that if either a or p is zero or a negative integer, the series (7-33) 
terminates after a finite number of terms and, therefore, represents an entire 
function. In all other cases, a simple application of the ratio test (ref. 23) will 
suffice to show that it converges in the circle |t| < 1 and diverges outside this 
circle. 

We have, therefore, shown that, provided y is neither zero nor a negative 
integer, 


u=F(tx,ti-,y\ t) 


(7-35) 


is a solution to the hypergeometric equation 


t{\—t)u"+ [y — (a + /3+ \)t]u' — ocf}u = 0 (7-36) 


at least in the circle |f| < 1. 

Since equation (7-36) is a Riemann-Papperitz equation, its solutions u 
can be denoted by the P-symbol 


0 


00 


u=P\ 0 


0 


a 


^1 — y y — a—p ft 


(7-37) 


Also, inasmuch as the order in which the exponents are listed in a given column 
is immaterial, we can interchange those in the first column to obtain 


0 


1 


00 


u = P\ 1— y 0 at 

0 y — a — p p 


Hence, it follows from equation (7-11) that 
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0 


1 


oo 


u=t 1 ~ y P | 0 0 a +l — y 

O' — 1 y — a — (5 P+1 — y 


(7-38) 


Notice that the P-symbol in equation (7-38) is in canonical form. It, there- 
fore, denotes a solution to the hypergeometric equation whose exponents at 
the points 0, 1, and °° are 0, y — 1: 0,y — a — /3; anda+ 1 — y, P+ 1 — y, respec- 
tively. And we can choose this solution to be the hypergeometric function F. 
Hence, upon comparing equations (7-35) and (7—37) with the P-symbol in equa- 
tion (7—38), we find that 


u=t l ~ y F(a + 1 — y, p+l—y; 2 — y; t) (7-39) 

is also a solution to equation (7-36) provided, of course, 2 — y is neither zero 
nor a negative integer. Since the hypergeometric series converges for 1 1\ < 1 , 
the function (7-39) must satisfy the differential equation in this region. Notice 
that if y= 1, the solutions (7-35) and (7-39) are identical. In fact, equation 
(7-37) shows that, in this case, equation (7-36) has equal exponents at r = 0 
and, therefore, one of the solutions composing the canonical basis at t = 0 must 
contain logarithmic terms. Next, if y is any integer other than unity, either 
y or 2 — y is either zero or a negative integer. Thus, either the hypergeometric 
function (7-35) or the hypergeometric function (7-39) is undefined. The remain- 
ing equation provides a solution to the differential equation. Finally, if y is not 
an integer, both equations (7-35) and (7-39) are solutions to the differential 
equation in the circle |f|<l. Since the series expansion for equation (7-35) 
begins with t° and that for equation (7-39) with t x ~ y , they must also be linearly 
independent solutions. 

Interchanging the exponents in the second column of equation (7-37) leads 
in the manner described above to the solution 

u= (*-l)v-«-0F(y-/3,y-a; y; t) (7-40) 

in the domain |f| <1. And interchanging both the exponents in the first column 
and those in the second column leads to a fourth solution 


u=t t - y (t-l) y - a -(*F(l-p, 1 — a; 2 — y; t) (7-41) 


218 


RIEMANN-PAPPERITZ AND HYPERGEOMETRIC EQUATIONS 


in the domain |t| <1. But, at most, two of these four solutions can be linearly 
independent. If y is not an integer, there exists a linear relation connecting 
any three of them. And, if y is an integer, the two solutions which exist can differ 
only by a multiplicative constant. 

Another set of four solutions can be obtained by interchanging the points 
0 and 1 by means of the transformation z= 1 — t. Thus, for example, equation 
(7-37) is equivalent to the equation 




Upon applying equation (7-11) to this equation twice in succession, we obtain 


/ 

1 

0 

“ \ 

u= (t-l)Y-«-^i-vp| 

0 

i 

O 

^ 7 


a + /3 — y y — 1 1 — a } 


It now follows from equations (7-13) and (7-14) that the transformation z= 1 — t 
maps the points 1, 0, and <» into the points 0, 1, and », respectively, and leads 
to the expression 


0 


1 


oo 


u={t-\)v- a -H'-yP\ 0 0 1 — >3 1 -t 

W+/3 — y y— 1 1— a 


Since this P-symbol is in canonical form, it represents a solution to a 
hypergeometric equation which we can again choose to be F. Thus, we find 
after comparing exponents with equations (7-35) and (7-37) that 


u= 1 — a; 1 + y — a— /3; 1 — t) (7-42) 
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Notice that the hypergeometric series in the solution (7-35) converges in 
the circle |t| < 1, while that in the solution (7-42) converges in the circle 
— 1 1 < 1. By systematically exploiting the technique used above, Kummer 
(1810 to 1893) obtained 24 solutions to the hypergeometric equation (7-36), 
each of which is expressed in terms of the hypergeometric function F. In 
each of these solutions the variable is one of the quantities t, 1 — t, (t — l)/t, 
l/t, and (1 — t) -1 . Since the function F with variable t converges 
in the unit circle |l| < 1, these solutions will converge in one of the regions 
shown in figure 7-1. Each series will converge in the appropriate shaded region 
and diverge outside it unless the series terminates; in which case, it will 
converge in the entire plane. Of the 24 solutions, four converge in each of 
the regions shown. Of course, only two solutions can be linearly independent 
in any given region. Some of the regions shown in the figure overlap. Hence, 
there are numerous relations among the various 24 solutions obtained by 
Kummer. These relations not only provide useful identities among hyper- 
geometric functions but also provide formulas which can be used to ana- 
lytically continue the solutions of the Riemann-Papperitz equation from one 
region of the plane to another. Thus, it can be shown, for example, that when 
y is neither zero nor a negative integer, the identity 


F(a, (3; y; t) = (1 - f )v-«-*F(y-a, y-p; y; t) (7-43) 


holds in the domain |;| < 1. An example of an identity which holds in the inter- 
section of the regions shown in figures 7-1 (a) and (b) is 

F (a,p;y;t) = (l-t)~ a F (a,y- p;y;jzT[) for 1*1 < 1 and 1*1 < l 1- *l 

It is frequently convenient to express the hypergeometric function as an 
integral. This can be done when y > 0L P > 0 and \t\ < 1. In order to arrive 
at this result, we substitute equation (5-35) into equation (7-33) to obtain 


F 


(a,p; y; t) = 



(q)n T (j8+ n) 
n\ r (y -+- n) 


t n 


(7-44) 
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Since it follows from equations (5-37) and (5-38) that 


r Q 3+n) T (y-/3) 
T ( y+n ) 


B (/3+ n,y — 



r /3+n-l 


(\—T) y ~P~ l dT 


for^y>^/3>0 


equation (7-44) can be written as 


F («.£; y; 0 = TWr^=J)| 0 ( -^‘"/„' t»—(i 


Upon interchanging the order of integration and summation (see section 
5.6), we find that 


F (a, /3; y; t) = 


r(y) 


ros) r (y 






But the series can be summed by using equations (7—33) and (7—34) to obtain 
the integral representation 


F (a, P; y; t) = r^r^y-p) J 0 ^'(l- r)^" 1 (1 ~Tt) “dr 

for 0?Ly > &L (3 > 0 and |f | < 1 

which was discovered by Euler. Since this integral exists and represents an 
analytic function for all values of t except where t is real and larger than 1, 
it provides an analytic continuation of F(a , /3; y; t) from the unit circle 
jf | < 1 to the entire t-plane cut along the positive real axis. Barnes obtained 
the more general integral representation 


F {a,p; y; t ) 


1 T(y) p°° T (a + z) T (/3 + z) T (— z) 

mtmtwu nv+n 


(~t) z dz 


where |arg (— f)| < tt and the integration is to be carried out along any path 
which lies to the right of the poles of T(a + z) T(/3 + z) and to the left of the 
poles of T (— z) . 
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The six functions F(a± 1, P; y; t), F(a, /3± 1, y; t), and F(a, /3; y± 1; f) 
are said to be contiguous to F(a, /3; y; t). Gauss proved that there is a linear 
relation between F(a, p; y; t) and any two of its contiguous functions, the 
coefficients being linear polynomials in t. Some examples of these relations are 

y(l-t)F(at,P;y;t) = (p-y)tF(ac,p;y+l;t)+yF(a- l,p;y;t) 

0 p-a)F(a,P;y;t)=pF(oi,p+l;y;t)-aF(<x+l , p\ y t) 

The derivative of a hypergeometric function is also a hypergeometric 
function. In order to show this, notice that the series (7—33) can be differentiat- 
ed term by term within its circle of convergence to obtain 

dF = * (q)n (£),<"-» 

dt 2/j (y)„(n-l)! 


Upon shifting the index by putting k=n— 1, this becomes 


dF = y (oQfc+i(/3)fc+i 

dt (y) fc+ iA:! 


(7-45) 


But since it follows from definition (5-33) that (a)fc+i = a(a+ l) fc , (P)k+i — 
P(P+l)k, and (y) fc +i = y(y+l)fc, equation (7-45) shows that 


d_F{a , )3; y; t) __ap 
dt y 


F(a+ 1, j3+ 1; y+ 1; t) 


(7-46) 


More generally, the Jacobi identity 

^_[ i a+n-i/r( Q! , y; t)] = (a) n j“- 1 F(a + n, P; y; t ) ( 7-47) 


can be established by multiplying both sides of the identity by f 1_ “ and then 
verifying that the left side satisfies the hypergeometric equation with param- 
eters «+ n, p, y. 
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There are many other relations connecting the hypergeometric functions. 
The reader can find these tabulated in reference 26 and discussed in some 
detail in reference 27. Many of the known functions, such as the elliptic inte- 
grals of the first and second kind, the inverse sine, etc., can be expressed as 
hypergeometric functions. 

7.5 GENERAUZED HYPERGEOMETRIC FUNCTIONS 

The concept of hypergeometric function can, itself, be generalized. To 
this end, notice that in the definition of the hypergeometric function 


F 


(a,/3;y;z) 



(0t)n(P)n 

(y)nn\ 


the numerator and denominator of the coefficient of z n /nl contain only products 
of generalized factorial functions. Hence, it is natural to define the generalized 
hyper geometric function pF q by 


pF q {ot\, a 2 , 


ot p ; 71,72, 


•.7«;z) = j£ 

n=0 


(<*l)n(Q!2)n • • 
n! ( 7 i)« (72)11 


(o?p)n£ n 

■ ( 7 o)n 

(7-48) 


provided the series converges. 87 The numbers p and q are nonnegative integers 
which denote the number of parameters in the numerator and in the denomi- 
nator, respectively. Thus, the ordinary hypergeometric function F is a 2 F i- 
Since the parameter /3 cancels out in the numerator and denominator in 
equation (7-34), we see that 


where we indicate that there is no parameter in the denominator by 
inserting a dash. The function ii^i is treated in the next chapter. The generalized 
hypergeometric functions are studied in considerable detail in reference 27. 


87 This series will always converge in the unit circle if g — pd- 1, and it will converge in the entire plane if q >P + 1* 
The other cases are more complex. 
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7.6 LOGARITHMIC SOLUTION TO HYPERGEOMETRIC EQUATION 

We have found two linearly independent solutions to the hypergeometric 
equation (7-36) in a neighborhood of t = 0 for the case where y is not an 
integer. However, when y is an integer, only one solution has been found. 
The remaining solution can be found by the methods of chapter 6. Thus, for 
example, when y is a positive integer, Ui = F (a, y; t ) provides one solution 
to the hypergeometric equation (7-36); and the methods of chapter 6 show 
that if neither a nor /3 is a positive integer smaller than y, a second linearly 
independent solution is given by 


« 2 


-F (a, for, 1) l nt ~ (l-a)l 1 (l-ff), t , t - 1 

+ t [« (a, n)+H (0, n)—H (y, n) —H (l,n)] t* 

n=l n - \ r)n 


where the finite sum is to be omitted when y= 1 and we have put 


88 


n-1 


H(a. n) = X “ 

fc =0 u 


+k 


We can obtain the second solution for the case where y is zero or a negative 
integer in the same way by starting with the solution 

ui = t l ~ y F(a+ 1 — y, /3+ 1 — y; 2 — y; t ) 


7.7 SPECIAL RIEMANN-PAPPERITZ EQUATIONS 

The classical Jacobi differential equation 


(1 — z 2 )w" + [6 — a — (a+6 + 2)z]tt/ + n(n + a + 6+l)u; — 0 (7—49) 


88 The function defined in eq. (6-89) is identical to //( 1 , n). 
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is a particular Riemann-Papperitz equation with singular points atz = ±l, 
It is readily established that in this case 


w=P\ 0 


— a 


— 1 ^ 

0 — n 

—b n+a+6+1 



(7-50) 


This equation can be put into canonical form by the change of independent 
variable t= (1 —z)l 2. Thus, 



1 oo 

0 — n 

— b n+a+6+1 



Therefore, when a is not a negative integer, a particular solution wi to the 
Jacobi differential equation is 

w i — F ^ — n, n+a+6+1; a+1; ^ (7—51) 

Equation (7-51) satisfies the differential equation within the circle |1— z\ <2 
for all values of the parameter n. But if n is a nonnegative integer, the hyper- 
geometric series terminates after a finite number of terms. For such values of 
n, the Jacobi polynomial is defined as 

/*«.ft>(z) = ( a ~t"^)” f n -\- a + 6+1; a+1 forn = 0, 1,2, . . . 

(7-52) 

A convenient expression for these polynomials is provided by the Rodrigues 
formula 

P(, a * 6) (z)=-^|^(l-z)-«(l + z)- 6 -^[(l-z) fl+ ”(l+z) 6+n ] (7-53) 
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This formula was obtained for the special case of Legendre polynomials (in- 
troduced below) by Olinde Rodrigues in 1816. In order to derive this result, 
we use equation (7—34) together with the Jacobi identity (7—47) to show that 


*-«(!-*) 


d n 

dt n 


[(1 -t) b+n t a+n ] 


= t -a (l — [t a+n F (a+1, — n — 6;a+l;t)] 

* (a+l)„(l — t)~ b F (a+l + n, — n — b;a+\;t) (7-54) 

But it follows from equation (7-43) that 

(1 — t)~ b F (a + 1 + n, — n — 6; a + 1; t) =F ( — n, a + 1 + n-\- 6; a + 1; t) 

Hence, after substituting this into equation (7-54) and making the change of 
variable t= (1— z)/2 in the resulting expression, we find, upon comparing the 
result with definition (7-52), that the Rodrigues formula (7-53) holds. 

A number of important equations which have been extensively studied 
are special cases of Jacobi’s equation. Thus, when a=b, equation (7-49) 
reduces to the ultraspherical differential equation 


(1 —z 2 ) w"— 2 (a+1) zw' + n (n + 2a+l) u/=l ) (7-55) 

A solution to this equation is given by equation (7—51) with a = b. But, 
when n is a nonnegative integer and the series terminates, 89 it is customary to 
use a different normalization factor from the one used in equation (7-52) and 
to express the solution in terms of the Gegenbauer polynomial 

CZ(z) = 7 ■ />L«-(l/2).a-(l/2)]( 2 ) ( 7 _ 56 ) 

( a+ ^ 


89 The polynomial obtained directly from eq. (7-52) with a = b is sometimes called the ultraspherical 
polynomial . 
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where we have put a = a+ (1/2). 

The Tschebychev equation 

(1 — z 2 )w" — zw' + n 2 w = 0 (7-57) 

is, in turn, a special case of the ultraspherical equation (7-55) with a = 
— 1/2. One solution to this equation is therefore given by equation (7-51) 
with a=6= — 1/2. However, when n is a nonnegative integer, it is again 
customary to change the normalization and define the Tschebychev polynomial 
(of the first kind) T n in terms of the Jacobi polynomial (or ultraspherical poly- 
nomial) by 90 


Tn(z) 



/>(-l/2,-l/2)( 2 ) 


Another important special case of the ultraspherical equation occurs 
when we put a= 0 to obtain Legendre’s equation 


(l—z i )w"—2zw' + n(n+l)w=0 (7-58) 


One solution to this equation is given by equation (7-51) with a = b = 0. 
Thus, we define the Legendre function of the first kind P n by 


P n (z) = F (-n, n+ 1; 1; (7-59) 

And since this function is a polynomial when n is a nonnegative integer, it is 
then called the Legendre polynomial. Thus, the Legendre polynomial is a Jacobi 
polynomial with a — b — 0. 

A second (suitably normalized) linearly independent solution to equation 
(7—58) about z= 1 is called the Legendre function of the second kind and is 
denoted by Q n • If n is an integer, Q n will involve logarithmic terms. 

Notice that Legendre’s differential equation (7-58) (and, in fact, more gen- 
erally, the ultraspherical differential equation (7-55)) is invariant when z is 
replaced by — z. This suggests that we transform equation (7-58) by introducing 


90 It is not hard to show that T n satisfies the relation T n (cos 0)= cos n 6 , and this relation is often used to define 
this polynomial. 
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the new independent variable t = z 2 . The equation then becomes 
2t(l — t) n(n+l)w=0 

But this is again a hypergeometric equation whose parameters a, /3, and y are 
now —(1/2 )n, (l/2)(n + l), and 1/2, respectively. And since y is not an inte- 
ger, we can conclude from the results of the preceding section that this equation 
has two linearly independent solutions Wi and w<i about t = 0 which are given by 
equations (7-35) and (7-39), respectively. Thus, 

<7 - 60) 

r, ( n+1 n + 2 3 \ 

= 2 ’ 2 ’ 2 ’ / (7 - 61) 

They converge at least within the circle 1*1 < 1. When n is a nonnegative 
integer, the series for Wi will terminate for even values of n and that for w 2 
will terminate for odd values of n. However, the polynomials obtained in this 
manner must be identical, to within a constant factor, with the Legendre 
polynomials P n {z). 

The associated Legendre equation 


(1— z 2 ) w" — 2zw' -f j^n. (n+ 1) — w=0 (7-62) 

reduces to Legendre’s equation when m = 0. It is a Riemann-Papperitz equa- 
tion for all values of m. Its P-symbol is 


/ 1 

u’ = P I (1/2) m 
V (1/2) m 


-1 

00 

(1/2) m 

n 

— (1/2) m 

ri+ 1 


) 


But upon transforming this to normal form, we find that 
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w= (z 2 -l) m/2 P 


(' 


V 


m 


0 

0 

— m 


00 

m — n 

m + n+ 1 


If we let u be the transformed dependent variable, then 

w= (z 2 ~l) ml2 u 


(1 — z)/2 


(7-63) 


and 



0 

00 

0 

m — n 

— m 

m + n+ 1 

-1 

00 

0 

m — n 

— m 

m+ n -\- 1 


(1 — z )/2 


•) 


Hence, u satisfies the equation 

(1 — z 2 ) u "— 2 (m + 1) zu' + (n — m) (n + /n+l)u=0 (7-64) 

But when Legendre’s equation (7—58) is differentiated m times, we obtain 

fjm+2.., d m+l W . d m W 

(l-* 2 ) -2 (m+1) z ^.+1 + (n-m) (n+m+l)-^r = 0 

(7-65) 

Since P n and Q n are two linearly independent solutions of equation 
(7-58) about z=l, we see upon comparing equations (7-63), (7-64), and 
(7-65) that 


d m 
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and 

d m 

are two linearly independent solutions to equation (7-62) about z= 1. They 
are called the associated Legendre functions of the first and second kind, 
respectively. 
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CHAPTER 8 


Confluent Hypergeometric Equation and 
Confluence of Singularities in 
Riemann-Papperitz Equation 


Having studied equations whose only singular points are regular singular 
points, it is natural to approach the study of equations with irregular singular 
points by applying a limiting process wherein two or more regular singular 
points of an equation are allowed to approach one another. The process, 
whereby two or more singular points in any linear differential equation are 
allowed to come together in such a way that at least one of the corresponding 
exponents becomes infinite, is called confluence, provided the limiting form of 
the differential equation exists. 

8.1 CONFLUENCE OF SINGULARITIES IN RIEMANN-PAPPERITZ EQUATION 

We shall apply this process to the Riemann-Papperitz equation whose 
singular points are located at 0, b, and 00 by letting the regular singular point 
b approach the regular singular point at °°. Now if we are going to allow the 
exponents at the points b and °° to become infinite, we must prescribe the 
manner in which they approach infinity. This can be done by letting these 
exponents be functions of b whose values approach infinity as b —■ * °°. If we 
suppose that the exponents are polynomials in b, it is not hard to show that the 
requirement that the limiting form of the differential equation exist implies 
that the exponents be, at most, linear in b. Now equation (7—8) is the general 
Riemann-Papperitz equation with one of its singular points at °o. If we require 
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that the two singular points in the finite part of the plane are at 0 and b and if 
we suppose the exponents at b are (3[ + bfi and £}[' + bfy' and those at oo are 
y'l + byz and y[' + by ' 2 ' , this equation becomes 


r 1 -a' -a” l- j 8;- i 8r-M^ + /3r) 

dz 2 z z — b 


dw 

dz 


+ [ (yl+h0 w+ W)1 tt— r =0 

L 2 b — z J (b-z)z 

(8-1) 


But if the coefficient of w is to remain finite as b ~ ► o°, we must require that 


0202 “7272 =0 


(8-2) 


Then upon taking the limit in equation (8-1), we obtain the equation 


<Pw 


dz 2 


+ (ft+ft' + 1 a ' a " ) 


dw 

dz 


+ 


I" a'a' 


+ 020 


(a' — l)02' + a"02 + ai 


w= 0 (8-3) 


where we have put 


«! = 7(72' + 7( '72 + ( 1 - - 0; )0r - ( «" + 0D02 


This equation has a regular singular point at z — 0 and an irregular singular 
point at z = oo. 
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8.2 TRANSFORMATION TO RUMMER'S CONFLUENT HYPERGEOMETRIC EQUA- 
TION 

Making the change of dependent variable 

w=z a "e-M g u (8-4) 

transforms equation (8-3) into the equation 

i^+b- <«'-«)*] 

where we have put 

y=l + a" — ot' (8-5) 

And, if 91 ^ , the change of independent variable 

t= (&'-&) z (8-6) 

leads to Kummer s confluent hypergeometric equation 

< 8 - 7 » 

where a — afl — fS 2 )- 

Thus, once the solution to equation (8-7) is known, we can find the solu- 
tions to any equation of the form (8-3). 

8.3 SOLUTIONS TO RUMMER'S EQUATION: CONFLUENT HYPERGEOMETRIC 
FUNCTIONS 

We shall, therefore, seek a solution to Kummer’s equation about the regu- 
lar singular point f = 0. Just as in the case of the hypergeometric equation of 

91 If /3" = /3\ the change of independent variable z= (r^lhai) and the change of dependent variable u = e~ (ll2>T v 
2 2 

lead to the equation T(d 2 vldT 2 ) + (2y — 1 — r) ( dvfdr ) — [ (2y — 1 )/2]v = 0, which is a special case of eq. (8-7). 
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Gauss, the characteristic exponents at this point are 0 and 1 — y. We again 
suppose first that y is neither zero nor a negative integer. Then equation (8-7) 

cx 

has a solution of the form u\ = ^ a n t n , which corresponds to the exponent 

n = 0 

p = 0. And we find, in the usual way, that 


^ a n (n — l + y) nt"- 1 — ^ a n -i(n — l + a)t n ~ 1 = 0 

n - 0 n= 1 


Therefore, ao is arbitrary and the recurrence relation is 

(n— 1 + a) r „ 

«. = „ ( „_ 1+r) for n = ] 2 

which can be solved to obtain 


(«)n 

!(r)» 


do 


for n= 1,2, . . . 


Hence, by using the generalized hypergeometric notation, the solution U\ 
can be expressed in the form 


ui = a 0 iFi(a; y; t ) 


where the function 


iFi{a; y; t) = J 


(*)n 
o n -(y)n 


t” 


(8-8) 


(8-9) 


is called the Pochhammer -Barnes confluent hypergeometric function. Since the 
only singularities of equation (8-7) are at r = 0 and t = o o. the solution (8-9) 
must be an entire function. 

Other commonly used notations for i-Fi(a; y; t) are M(a; y; t) and 3>(a; 
y; t). A complete discussion of confluent hypergeometric functions is given in 
reference 28. 
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Notice that by putting f = z//3 in the hypergeometric equation of Gauss 
(eq. (7-36)), we obtain the equation 

2 ( 1 _ f)§ + [ r_( “ +/ 3 + 1 ) |]f _ ““ =0 

And upon taking the limit £ — * 00 in this equation, we arrive at the confluent 
hypergeometric equation (8—7). On the other hand, by putting £ = z/£ in the 
hypergeometric function of Gauss (defined in eq. (7-33)), we obtain 


F 



(<*)n(£)n „ 

n'-(y)np nZ 


And since 


lim 

£-►00 


(£)n 

£ n 


1 


we see that at least formally 

hm F ^a, £; y; = iFi(a; y; z) 

In a similar way, if y is not a positive integer, consideration of the series corre- 
sponding to the exponent 1— y shows that 


u 2 = t l - y iFi(a + 1 — y; 2 — y; t) 


( 8 - 10 ) 


is also a solution to equation (8-12) 

Thus, if y is not an integer, equations (8-9) and (8-10) provide a funda- 
mental set of solutions to equation (8-7). But if y is equal to 1, these two solu- 
tions are identical. And if y is an integer other than 1 , one of the hypergeometric 
functions in equations (8-9) and (8-10) is undefined and the other one provides 
a solution to the differential equation. However, when y is a positive integer and 
a is not a positive integer smaller than y, the other linearly independent 
solution is given by 
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7-2 

u=iF t (a; y; t) In t— ^ 

n= 0 


n!(l — y) n +i 

(1 — a) n +i 


f-(n+l) 


+ | [H(a, n) -ff(y, ») -tf (1, n)]l“ 

where the finite sum is to be omitted when 7=1 and H(a, n ) is defined in 
section 7.6. A corresponding result holds when 7 is a nonpositive integer, and 
the remaining cases should be treated individually by the methods of chapter 6. 

8.4 WHITTAKER'S FORM OF HYPERGEOMETRIC EQUATION 

There is another standard form into which equation (8-7) (and, therefore, 
as a consequence, eq. (8-3)) can be transformed. To obtain this equation, we 
make the change of variable 


W=t(my e -(mt u 


in equation (8-7) to obtain the equation 


<PW 


1 


dt 2 t 




In order to introduce standard notation, we put 

, 1 

k= 2 y — at 


and 


m ~2 (y -1 ) 


to get Whittaker s confluent hypergeometric equation 


( 8 - 11 ) 


( 8 - 12 ) 


(8-13) 


(8-14) 
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(PW 

dt* 


/ 



W= 0 


(8-15) 


It follows from equations (8-8), (8-10), (8-11), (8-13), and (8 14) that if 
2m is not an integer 

M k , m(t) = t'"+(i/2) e -(i/2)< X F X + 2m+l; t) (8-16) 

and ) are a fundamental set of solutions to equation (8—15) in the 

domain 0 < |t| < °°. 

The notation M k , m is used in reference 25. The Whittaker’s function 
W k , m is defined by 


r,,.w= , r( ~ 2m) -^M t ..w + T £ ^ gsL i 7W».-.(0 

r(— m — A: + ^ j — Ac + 2 j 

Since Whittaker’s equation (8-15) is unaltered if we replace Ac by — A: 
and t and — f, it follows that 

W — M- k , m (— t) and W= M- k ,-m(— t) (8-17) 

are also solutions to equation (8-15) in the region 0 < |t| < oo. And since this 
equation can have, at most, two linearly independent solutions, there must be 
constants C\ and C<i , such that 


4/-fc,m( — t) — CxM kt m (t) + C 2 M k , -m(t) (8 18) 

provided 2m is not an integer. But equations (8-9) and (8-16) show that 
M k , m {t ) and m (— t) behave like t m+(1/2) , that M k , behaves like t~ m+ill2) 
in the neighborhood of t = 0, and that 


239 



DIFFERENTIAL EQUATIONS 


(— f)m + (U2)M-k,m(t) ^ fm+ ( 1 /2)^fc.'«(0 ~ 1 

Hence, equation (8-18) can hold only if C 2 = 0 and C i = (— l) m+(i/ 2 ) > Therefore, 

But substituting equation (8-16) in this equation shows that 

e< *^‘(2 ^ 1 — = 1-^1 Q — k + m; 1 + 2m; t^ 

And upon using equations (8-13) and (8-14) to express k and m in terms of 

a and y, we obtain Kummer’s first formula 

1 F 1 (a; y; t) = e ( 1 F 1 (y — a; y; — t) (8-19) 

This equation holds even when y is a positive integer, for in this case, we 

could have replaced M k ,_ m by the appropriate logarithmic solution and 
carried through the proof in the same way. 

8.5 LAGUERRE POLYNOMIALS 

If the parameter a in equation (8—7) is zero or a negative integer, the 
solution iFi(a; y; t ) (defined in eq. (8-9)) terminates after a finite number 
of terms and becomes a polynomial. This polynomial differs from the gen- 
eralized Laguerre polynomial 

L i * ) (t) = 1^1 ( — «; 1 + 6; t) for n = 0, 1,2, . . . (8-20) 

only by a normalizing factor. When b = 0 in equation (8-20), we obtain the 
simple Laguerre polynomial L n defined by 

L n {t) = 1 F 1 (— n; 1; t) 

Equations (8-9) and (8-20) show that 
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j= o 


(—n)j(l + b) ntf 

m + b)jn\ 


And since (—n)j/n\= (— l)->/(n — j ) ! 


L ( m = 5 

j = 0 


(l + 6)n(~Q J 

(1 + b)jjl(n-j)\ 


( 8 - 21 ) 


and 


Ln{t) 


A n!(-t)J 


( 8 - 22 ) 


8.6 BESSEL'S EQUATION 

An important special case of equation (8—3) occurs when 

a" = — a' = p 

2 = i 

ai = 2i(a”+i) = (p + i)2i 


The equation 


z 2 w" + zw' + (z 2 — p 2 )w— 0 


(8-23) 


(8-24) 


obtained in this manner is called Bessel’s equation. When p is not a negative 
integer, the methods of chapter 6 can be applied in the usual way to show that 
(see section 7.5) 


oo 

2 


(— I)*z2* + P 

2 2k kUl + p) k 



(8-25) 


241 



DIFFERENTIAL EQUATIONS 


is a solution to equation (8-24) about the regular singular point 2=0. However, 
it is conventional to multiply this solution by the normalization factor 
[2 p r(l+p)] _1 and to use equation (5—35) to eliminate the generalized factorial 
function in the denominator. When this is done, we obtain the suitable nor- 
malized solution 


Wi=J p 


(8-26) 


where 


00 f_iu / 7 \2 k + p 

(8 - 27) 

is the Bessel's function of the first kind of order p. Notice that this function is 
defined even when p is a negative integer. And since z = 0 is the only finite 
singular point of the differential equation, the series must converge in the 
entire plane. Although Bessell functions were used by both Leonard Euler and 
Daniel Bernoulli before Bessel was born, the German astronomer Fredrick 
Wilhelm Bessel was the first to make a detailed study of them. 

Since the function in equation (8-25) differs from J p only by the normaliza- 
tion factor [2 p r(l + p)] -1 when p is not a negative integer, it follows that 

Jp(z ) = f ( Y +p ) oFl (~ ; 1 +p: ~~4 ) for p — 1 , — 2 , . . . (8-28) 


On the other hand, we know that equation (8-24) can be transformed into 
Rummer’s confluent hypergeometric equation (8-7) by using the change ot 
variables given by equations (8-4) to (8-6). 

The solution J p can, therefore, be expressed in terms of a linear combina- 
tion of solutions of equation (8-7). Thus, if p is not an integer, it follows from 
equations (8-4) to (8-6), (8-8), (8-10), and (8-23) that 

Jp(z) —Az p e~ lz iFi ^ P + 2 * 1 + 2p; 2iz ^ + Bz~ p e~ iz 1 F 1 ^ ^ — p; 1— 2p; 2 iz^J 
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But upon comparing the behavior of these functions at 2=0, we find, when 
p is not an integer, that this equation can only hold if B = 0 and that 
A — [2Pr(l + p)] -1 . Hence, 

« 2)= 2JF?lT^' f '(p + r 1+2p;2iz ) 

Comparing this with equation (8-28) shows that 

oFi ; 1+p;— j ^ = e~ iz iF i l + 2p; 2 iz^j 

Then by putting £=zz and a = p + (1/2), we obtain Kummer’s second formula 
oFi ; a + 1; | r) = « -{ 1 F 1 (a; 2a; 2£) 

When p is a negative integer, there is a close relationship between J p 
and J- p . Thus, suppose p = — n for n=l. 2, . . . Then since the F-function 
has poles at each of the non positive integers, all the terms in the sum (8-27) 
will vanish for k < n; and we obtain 

J ( v (D* (zy*- n 

J-AZ) klT(l + k-n) v 2 ) 

But this becomes, upon shift of index from k to k — n, 

i = V ( — l) fc+n / z \ 2 * + " 

J ~ nK } j£ 0 (k+n)\T(l + k) \ 2 / 

Hence, it follows from equation (5—36) that 

J-n(z) = (-\) n Mz) for n= 1, 2, . . . (8-29) 
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Since w\ — J p is a solution to equation (8-29) for all values of p and since the 
parameter p enters equation (8-24) only as p 2 , the function 


m=J-p(z) (8-30) 

must also be a solution to this equation for all values of p. When p is an integer, 
equation (8-29) shows that the solutions w\ and w 2 are not linearly independent. 
However, for any nonintegral value of p, consideration of the behavior of the 
functions J p and J~ p in the neighborhood of z = 0 shows that w\ and w 2 consti- 
tute a fundamental set of solutions. In order to obtain a second linearly independ- 
ent solution to equation (8-24) when p is an integer, notice that for any value 
of p 


z 2 Jp+zJ'p+ ( z 2 — p 2 )J p = 0 
z 2 J"p + zj'_p +(z 2 -p 2 )J-p = 0 


(8-31) 


Since the series (8-27) is uniformly convergent, it can be shown that J p , J- p , 
and their derivatives with respect to z are all differentiable with respect to the 
parameter p. Hence, upon differentiating equations (8-31) with respect to p 
and subtracting the result, we obtain 


+2</p+ {z 2 -p 2 )jp= 2 p [j p - (- 1 ) nj_ p ] ( 8-32 ) 


where we have put 




dJ_p 

dp 


(_!)„ 


dj_-p 

dp 


By taking the limit as p—*n in equation (8—32) and using equation (8—29), 
we find that the Bessel function of the second kind 


=i a™ (8 _ 33) 

TT p—*n l dp dp J 

is a solution to equation (8-24) with p= n. But upon substituting equation (8-27) 
into this expression and noting that 
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lim for £ = 0, 1,2, . . . 

z-*-A P(z) dz 

it follows after some algebraic manipulation, which we shall omit here, that 




1 (h-k- 1)! 


k\ 


(D 


2 k-n 


1 oo f_lU / z \ 2k+n 

(8 - 34) 

where the finite sum is to be omitted if n = 0, H k is defined by equation (6-89), 
and y is Euler’s constant defined by 

y= lim (H k - In k) = 0.5772156649 . . . (8-35) 


It is easy to see from equations (8-27) and (8-34) that Y n and J n are a funda- 
mental set of solutions. 

It is convenient to extend the definition (8—33) of Y n to nonintegral values of 
p. This is now done by putting 


Y P (z) 


Jpjz) COS P7T— J-pjz) 
sin pir 


(8-36) 


when p is not an integer. Then Y p and J p are a fundamental set of solutions. 
And both the numerator and denominator of this expression vanish when 
p is an integer. But by using L’HospitaFs rule, we find that for n = 0, 1, 2, . . . 


lim Y p (z) = Urn Jp p7r J ~ p 

p-*n ^ p-n Sin pTT 


lim 

p—*n 


djj 

dp 


cos 7 rp ■ 


. dj~i 

dp 

77 COS pTT 


■ttJ p sin prr 


= Urn ( — ^ ) =- Urn f ^2- (- 1)" ^=2 1 

77 p->n \ dp COS p77 dp / 7T p-n \_ dp dp J 

And comparing this with equation (8-33) shows that definition (8-36) is indeed 
an extension of definition (8 — 33). 
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For large values of z with |arg z\ < n, the Bessel functions J p and Y p 
behave like 

} (8-37) 

However, it is sometimes convenient to have solutions to Bessel’s equation 
which tend to zero exponentially as |z| — ^ 00 in the entire half plane^Ji^z > 0 
or in the half plane ^ z < 0. But since equations (8-37) show that for large z 

J P (z)±iY p (z ) ~ ^ ^ 1/2 e ±Hz-(i™i2)-("l4)] 

we see this property is possessed by functions J n ±iY p . For this reason, the 
Hankel functions of the first and, second kind H ^ and respectively, are 
defined by pp 


H^(z)=J p (z) + iY p (z) 


Hf(z)= J p (z)~iY p (z) 

It is not hard to show that these solutions are a fundamental set. 

In a number of applications, we encounter the equation 

z 2 w" + zw' — (z 2 + p 2 )w= 0 (8-38) 

which is obtained from Bessel’s equation by replacing z by iz. Although J p {iz) 
3nd Y p {iz) are a fundamental set of solutions of equation (8—38), it is convenient 
to introduce new functions which are real for real values of z (at least for real 
values of p). To this end the modified Bessel functions of the first and third 
kind I p and K p , respectively, are defined by 


I P (z ) = e-^MJpdz) 
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K p (z)='fe i ^)H^(iz) = : fe i ^^ [J p (iz) + iY p (iz)] 

There are a number of useful relations between the Bessel functions and 
their derivatives. Thus, it follows from equation (8-27) and equation (5-31) 
that 

VI I (-1 )fc 2 2* + 2 P 

zjp(z) ^2^Pk\{k + p)Y{k + p) 

But upon differentiating both sides with respect to z, we find that 

i W»M ) ' =2 zlAir (t +rt w (8 - 39) 


In a similar way it can be shown that 

d 


dz 


z~ p J p (z) ——z-pJ p +i(z) 


(8-40) 


After carrying out the differentiations in equations (8-39) and (8-40), we obtain 

z ^j£=zJ P -i—pJp (8^41) 


z ^ = — zJ p +i + pjp 


(8-42) 


And by adding and subtracting these equations, we find 

9 djp . . 

dz ~ Jp - 1 Jp+1 


(8-43) 


and 


2 pj p — zj p-i d - zj p + 1 


(8^14) 
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The formulas (8-39) to (8-44) are very useful when dealing with Bessel func- 
tions. They also apply to the Bessel functions Y p , H ( J\ and H^\ However, 
they must be modified slightly for the functions I p and K p (due to the presence 
of iz). 

A much more complete treatment of Bessel functions can be found in 
reference 29. 

8.7 WEBER'S EQUATION 

Another equation, which has been extensively studied, can be obtained 
from equation (8—3) by putting 





«i = -| (8-45) 


to get 


"" + l w ' + l(| + n_ I 2 ) w=0 


(8-46) 


and then making the change of independent variable 


z= ? (8-47) 

to obtain Weber’s equation 

cPw / 1 1 \ 

ll? + \ n +2-l?) w=0 (8-48) 

This equation was first studied by Hermite (1864) and then by Weber. Its 
solutions are called parabolic cylinder functions or Weber functions. 

Since equation (8-46) can be transformed into the confluent hypergeo- 
metric equation (8-7) by the change of variables (8-4) and (8-6), equation 
(8-48) can be transformed into equation (8-7) by the change of variables (8-4) 
to (8-6) and (8-47). Hence, the solutions to equation (8-48) can be expressed in 
terms of the solutions to equation (8-7). Therefore, use of equations (8-4) to 
(8-6), (8-8), (8-10), (8-45), and (8-47) shows that a fundamental set of solu- 
tions to Weber’s equation (u>i, w 2 ) is given by 
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m = e -(mi\ Fl I; I (8-49) 

«* = e— < i/4K2£2 iFi | \ C 2 ) (8-50) 

Notice that if n is a positive even integer, the hypergeometric function in 
equation (8-49) terminates and becomes a generalized Laguerre polynomial; 
while if n is a positive odd integer, the hypergeometric function in equation 
(8-50) becomes a generalized Laguerre polynomial. Thus, it follows from equa- 
tions (8-20), (8-49), and (8-50) that, if n is a positive integer, equation (8-48) 
has one solution w 3 of the form 

w 3 = e-^Hn (0 for n = 1 , 2, . . . (8-51 ) 


where H n is the nth Hermite polynomial, which is defined in terms of the gen- 
eralized Laguerre polynomials by 

#2/c(£) = (— l) fc 2 2fc A:!Lj“ 1/2) (£ 2 ) ) 


k = 0,1,2,. . . (8-52) 


H 2k+ x (£) = (- l) fc 2 2fc+1 A:!^ (1/2) (C 2 ) 


8.8 OTHER EQUATIONS STUDIED 

Although we have been able to give a complete presentation of the gen- 
eral theory of the solutions to second-order linear equations about regular sin- 
gular points, apart from relatively trivial cases, only the solutions to those 
equations which can be transformed into the hypergeometric equations of 
Gauss and of Kummer have been exhaustively studied. The equations of 
Lame and Mathieu have also been studied, but only a small amount of knowl- 
edge of the functions defined by these equations has been obtained. The 
Mathieu equation is 


d?w 

dz 2 


f {a + k 2 cos 2 z)w=0 


where a and k are parameters, and the Lame equation is 
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(ffi'W 

+ [r] + n(n+l)msn 3 (z\m)]w=0 


where sn(z\m ) is the sine amplitude elliptic function of modulus m, and 77 , n, 
and m are parameters. When n = 2, its solutions can be expressed in terms of 
elliptic functions. For more information about these equations, the reader is 
referred to reference 3. An extremely complete discussion of Mathieu’s equa- 
tion with many applications is given in reference 30. 

All the equations which have been discussed in the last two chapters can 
be derived from a single equation with six distinct regular singular points (with 
the characteristic exponents at each singular point differing by 1 / 2 ) by allowing 
these singular points to coalesce. The coalescence of any two such points is a 
regular singular point whose exponents differ by an arbitrary amount. The 
coalescence of three or more such points will result in an irregular singular 
point. The reader is referred to reference 3, chapter 20 , for a more detailed 
study of these matters. 
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Solutions Near Irregular Singular Points: 
Asymptotic Expansions 


9.1 GENERAL CHARACTER OF SOLUTIONS 

Let zo be an isolated singular point of the differential equation 

S^ +p ( z ) ^ +q ^ w ~° o^-i) 

We have shown in chapter 6 that it is always possible to construct a funda- 
mental set of solutions to this equation on a punctured circular region about 
zo whenever zo is a regular singular point. And these solutions can be expressed 
in terms of certain convergent power series whose coefficients can be calculated 
successively. This occurs because the functions /i(z) and /2 (z) in the canonical 
basis (6-20) can have, at most, poles at zo. However, this will not be the case 
when the point zo is an irregular singular point of equation (9-1) since at least 
one of the two functions /i(z) and fziz) will then have an essential singularity 
at zo. This function will therefore have a series expansion about zo of the form 
(6-21) in which infinitely many negative powers of z — Zo are present. And the 
coefficients of this series are determined by an infinite set of equations which 
cannot be solved recursively. For this reason we do not have a convenient 
method of finding the corresponding solution. In addition, the characteristic 
exponents are determined by a transcendental equation (which usually cannot 
be solved explicitly) instead of by an algebraic equation. Finally, the series 
once found are usually slowly convergent. 
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9.2 FORMAL POWER SERIES 

Even if zo is an irregular singular point, it may still happen that the function 
/i(z) in the canonical basis (6-20) has, at most, a pole at zo, in which case it 
would still be possible to recursively construct 92 one solution about this point. 
And this solution would be of the form 93 

w= (z — z 0 ) p /(z) (9-2) 

where /(z) is analytic in a neighborhood of zo, and therefore has a convergent 
power series expansion 


f(z) = 2 «n(z-zo) n (9-3) 

n = 0 

about zo. We assume, without loss of generality, that the exponent p is adjusted 
to make ao # 0 and therefore to make 

/(z 0 ) ^ 0 (9-4) 

A second solution could then be found, at least in principle, from equation 
(6-63). 

When p(z) has an essential singularity at zo, equation (9-1) can possess 
a solution of the form (9-2) only when q(z) is equal to zero or has an essential 
singularity at zo- But suppose that p(z) has a pole at z 0 , say of order m. Thus, 

p(z) = (z— zo) -m P(z) (9-5) 

where P(z) is analytic at z = zo- Now if equation (9—1) possesses a regular solu- 
tion of the form (9-2), this solution together with (9-5) can be substituted into 
equation (9—1) to obtain 


92 If it were the function fi{z) which had the pole at z o and the function fi{z) which had the essential singularity, 
then either we could (when a — 0) change the notation so that the function ft(z) had the pole at zo or (when a # 0) the 
solution u)z would depend upon /i(z) (through wi{z )) and could therefore not be constructed recursively. 

93 Recall (ch. 6) that a solution of this type is called a regular solution. 
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„ (A /"(«) 2 p /'(z) p(p-l) 

/(z) z-zof(z) (z-z 0 ) 2 

— r P -1 P( z ) (9-6) 

|_(z-z 0 ) m f(z) + (z-z 0 ) m+1 J (Z) ( } 

And since /(zo) ^ 0, it follows that the functions /"//, / //, an d F* are analytic 
atz=zo- 

First, suppose that m is either zero or 1. Then equation (9-6) shows that 
q(z ) has, at most, a pole of order two. We therefore conclude that when p(z) 
has, at most, a simple pole at z o, equation (9-1) will possess a regular solution 
of the form (9-2) if, and only if, z o is a regular singular point. 

Now, suppose that m > 1. Then equation (9-6) shows that q(z) has, at most, 
a pole of order m + 1. Hence, we conclude that when p(z) has a pole of order 
m > 1 at a point zo, equation (9-1) can possess a regular solution about this 
point only if q(z) has, at most, a pole of order m + 1 at zo- 

The corresponding result for the case where the point zo is at oo can be 
obtained in the usual way by applying these results at the origin of the £-plane 
to the coefficients of the transformed equation given by equations (6-28) and 
(6-29). This leads to the conclusion that if p(z) has a pole of order 94 m 2= 0 
about z=°°, equation (9—1) possesses a solution about this point of the form 



n=0 14 


only if the highest power of z which occurs in the Laurent series expansion of 
q(z) about z= oo is m— 1. 

Now suppose that zo is a finite point, p(z) has a pole of order m at zo, and 
equation (9-1) possesses a solution of the form (9-2) about zo. Then, q(z) must 
be of the form 


q(z) = (z-zo)~ m - 1 Q(z) 


(9-7) 


94 Recall that we say that a function has a pole of order zero at a point if it is analytic at that point. 
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where Q{z) is analytic at z = zo. This function Q(z ) and the function P(z ) 
which appears in the representation (9-5) of p(z) can be expanded about z o 
in the convergent power series 


P(z) = J Pn(z Z 0 ) n I 

n = 0 I 

(9-8) 

Q(z)= j? Qn(z-Zo) n I 
n = 0 / 


where p o ^ 0. 

We can now proceed much as we did in chapter 6 for the case of a regular 
singular point by substituting equations (9-2), (9-3), (9-5), (9-7), and (9-8) 
with m> 1 into equation (9-1) and then collecting the coefficients of like 
powers of z — zo- When this procedure is carried out, we find first that the 
indicial equation is not quadratic (as in the case of a regular singular point) 
but that it is an equation of the first degree which is 


ppo + qo = 0 (9-9) 

This is consistent with the fact that, at an irregular singular point, at most 
one solution of the form (9—2) can occur. 

Next, the coefficients of the series (9—3) can still be calculated recursively, 
but the resulting series will now either terminate after a finite number of terms 
or else it will diverge for all |z — zo| > 0 (ref. 3, p. 421). Thus, a regular solution 
will only exist in the exceptional case when the series (9-3) contains a finite 
number of terms. 

For example, the equation 


„ , „ (z — o) , 1 / ,3 ) _ n 

w + 2 - — - — w \ a + - z) w — 0 

z 2 z 3 \ 4 / 

has an irregular singular point at z = 0. And it has a series solution about this 
point given by 

w=z ~ m { 1 ~h z ) 
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which terminates after two terms. 

Even though it is possible to calculate as many terms as desired when the 
series does not terminate, the solution obtained in this manner will not have 
any meaning in the usual sense since the series does not converge. This should 
not be taken to mean, however, that this “formal” solution is not useful. 

9.3 ASYMPTOTIC CHARACTER OF SOLUTIONS 

In order to introduce the concepts which allow the formal solutions 
obtained in the preceding section to be utilized, it is convenient to consider 
an example given by Euler in 1754. Thus, the differential equation 

w" + — ^, - 1 iv' + — w = Q (9-10) 

Z z 

has an irregular singular point at the origin and a regular singular point at 
infinity. Since the order of the pole of the coefficient of w at z = 0 does not 
exceed the order of the pole of the coefficient of w' at z = 0 by more than 1, we 
know that equation (9-10) possesses one (and only one) formal solution about 
z = 0 of the form 


W = Z» £ anZ n 
n - 0 


(9-11) 


We can therefore proceed just as in the case of a regular singular point and 
substitute this expansion into equation (9-10), shift the indices to the lowest 
one present, and collect terms to obtain 


jr (n + p) 2 a„_iz n+p-1 + ^ ( n + p)anZ n+p 1 — 0 

n=l n=0 


Then by equating the coefficients of the various powers of z to zero, we find 
that since ao # 0 


p = 0 


(9-12) 
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and 


(n + p) 2 a n - 1 = — (n + p)a n for n = 1, 2, 3, . . . (9-13) 

The indicial equation (9-12) is consistent with the general indicial equation 
(9-9) since, in this case, qo — 0. When equation (9-12) is substituted into the 
recurrence relation (9-13), we obtain 


a n = — na n -i for n= 1,2,3,. 


(9-14) 


And by proceeding in the usual way, we conclude from this, that 

a n = (— l)"n!a 0 


Finally, substituting this, together with equation (9—12), into equation (9—11) 
shows that equation (9—10) possesses the formal solution 


w = ^ (— l)"n!z n 

n= 0 


(9-15) 


A simple application of the ratio test, however, shows that this series 
diverges for all 2 ^ 0 . Hence, as we have already indicated in the general case, 
equation (9-15) is not a regular solution. However, we might anticipate that, 
since equation (9—15) was determined in a formal manner by the differential 
equation, it might still, in some sense, represent a solution to that differential 
equation. 

In order to see what this is, note that equations (5-29) and (5-36) show that 


ASYMPTOTIC EXPANSIONS 


However, when this is substituted into equation (9-15), we find that 


w = 



e~ t {—\) n z n t n dt 


And again, proceeding formally we interchange the order of summation and 
integration to obtain 


w = 



(— zt) n dt 


Finally, summing the geometric series shows that 


w 


= r e ~ i 

Jo 1 + zt 


dt 


(9-17) 


Now this integral converges uniformly 95 for all z in any region R bounded 
away from the negative real axis. In fact, differentiating with respect to z and 
interchanging the order of integration and differentiation shows that 


dw = _ [“ te~‘ 

dz Jo (1 + zt) 2 


(9-18) 


95 This means that for any positive number e, no matter how small, there exists a positive number T(e) which 
depends on € but not on z such that I f [<?-'/( 1 + zt)] dt I < € for all t ^ T(e) and all z in R. 
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Figure 9-1. — Region of convergence of 


r 

Jo 


te~ 


(1+zO 2 


dt and 




(1+zt) 2 


dt. 


But since this integral converges uniformly for all values of z in any region 
bounded away from the negative real axis (fig. 9—1), we can conclude that the 
derivative exists and that interchanging the order of integration and differentia- 
tion was in fact justified, provided z is not on the negative real axis (ref. 31). 

r oo 

Hence, I [e~ t /{l + zt)\dt is an analytic function of z everywhere in the z- 

plane cut along the negative real axis and is therefore infinitely differentiable 
in this region. 

Upon multiplying equation (9-17) by z and equation (9-18) by z 2 and 
then adding the result, we get 


z 2 w' + zw 


=_22 f 


e~ c t 

(1+zf) 2 


dt z 


f°° ez! 

Jo 1 + zt 


dt 


=i; 


. t z( 1 + zf) — zH 

(l + zt ) 2 


dt 


j; 


ezjz 

(l+zt) 2 


dt 


And after integration by parts, this becomes 
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w 

which shows that w satisfies the equation 

z 2 w' + (z + 1 ) w = 1 

provided z is not on the negative real axis. Since w is infinitely differentiable, 
we can certainly differentiate this equation with respect to z. But when this is 
done we obtain equation (9-10). Thus, the function w given by equation 
(9-17) provides a perfectly valid solution to equation (9-10). This solution 
was obtained by formally summing the divergent series (9-15), which was, in 
turn, obtained by formally solving equation (9-10). It is, therefore, natural to 
ask in which sense the divergent series (9-15) represents the integral (9-17) 
and, therefore, as a consequence, in what sense it represents a solution to 
equation (9-10). To answer this question, let 


x r oc 

z 2 w' + zw = — ~ I — dt= 1 — 

1 + zt 0 Jo 1 + zf 


m 

Sm{z)= £ (-1 )"/»!*» (9-19) 

n - 0 

be the mth partial sum of the series (9-15). Now from elementary algebra, we 
know that the sum of the finite geometric series is 



n - 0 


J — r m + 1 


1 — r 


which becomes, upon setting r— — zt , 


1 

1 + zt 


( — zt) M+1 
1 + zt 


m 

2 (-i) n (zt) n 

n- o 


But, upon multiplying both sides by e -t and integrating between 0 and we 
find that 


p -t m 

— — dt= Y(— 1 ) n (zt) n e~ t dt+Rm(z) 

Jo 1 + zt Jo n “ 0 
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where we have put 

f 00 f tn -f- 1 p -t 

Rm(z) = (-z)”+iJ o (9-20) 

And since it is always legitimate to interchange the order of integration and 
summation for a finite sum, this becomes 

f°° p -t m r x 

I T— — dt — V (— l) n z" I t n e~ t dt = R m (z) 

Jo i+zt "<, Jo 

Hence, substituting in equations (9-16) and (9-19) shows that 

r T+X dt ~ Sm(z) = (9_21) 

Now since 


|1 + zt\ = |1 + xt + iyt\ = V(1 + xt) 2 + y 2 ? 


it follows that 

|1 + zt\ V(T+lcFp = |1 + xt\ ^ 1 foi z = x > 0 


And for £z = r^0we find that 

|1 + zt\ V ( xt) 2 + (ytj 2 = { (1 + s£) 2 (*0 2 + ^^[(l + xt) 2 + (xt) 2 ] + ( yt ) 4 } 1/2 

25 Vy2« 2 [(l + **) 2 + (xO 2 ] 5 \yt\ 


Hence 


|l + zf| 


_i \yt] 

V2 V(*t) 2 +(yt) 2 


1 |y| _ 1 

V 2 + V2 


sind>| 


for z=x< 0 
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where (f> is the argument of z. We have therefore shown that 

JT+ltf S 1 for ^ 230 (9-22) 

and 

TTT^ ^ |cosec 4>\ for z ^ 0; |<^| < 7r (9-23) 

It now follows from equations (9-20) and (9-22) that when M- z 3* 0 

r 00 I f m+\ p -t f x 

^ W m+1 T3TT dt ^ \ z \ m+1 t m+l e~*dt 
Jo | 1 + zt Jo 

And equations (9-16) and (9-21) therefore show that 

ki(z)-Sm(z)| < |z| m + 1 (m+l) ! for^ 2>0 (9-24) 

where 

r x -t 

Wl(z) ^J 0 TFFt dt (9-25) 

is the solution to equation (9—10). In a similar way, it follows from equation 
(9-23) that 

|m>i(z) — Sm(z ) I V2| cosec <£ | |z| m + 1 (m+ 1) ! for^ z< 0; \<f>\ < 7 r 

(9-26) 
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Thus, in view of equations (9 — 15) and (9-19), the inequalities (9-24) 
and (9-26) show that, provided <f), the argument of z, is smaller in absolute 
value than 7 r, the error incurred by computing the solution wi{z) by means of 
the divergent series (9-15) goes to zero rapidly as z -* 0. Thus, for any fixed m , 
any desired accuracy can be obtained by taking |z| sufficiently near zero. In 
particular, the inequality (9—24) shows that, provided z > 0, the error is 
never greater in magnitude than the first term omitted in the series. 

x 

Now recall that a series V a n is said to converge to a sum /I if its partial 

m 

sums a n satisfy the relation 


lim \A — t m \ = 0 

m->o o' 1 

The series (9-15) certainly does not converge to the solution Wi(z ) since 


lim \u>i(z) —Sm(z) | t^O (9-27) 

m — *<» 

But this series is said to be an asymptotic expansion (which will be defined 
subsequently) of wi(z) since, as can be seen from equations (9—24) and 
(9-26) , it has the property that 


J m(z) = 0 for \<f>\ ^ tt+6 (9-28) 


for any € > 0. 

9.4 ASYMPTOTIC EXPANSIONS 

Roughly speaking, equation (9-28) shows that the difference between 
the solution wi(z ) and the mth partial sum Sm(z) of the series (9-15) approaches 
zero more rapidly than z m as 0. Comparing the definition of convergence 
with the corresponding condition (9-28) for a series to be asymptotic, shows 
that the essential difference between these two concepts can be stated in the 
following way: If the series is convergent to u>i (z) , then z is held fixed and the 
difference Wi(z) — S m (z) approaches zero as m approaches infinity. If the 
series is asymptotic to w\ (z) , then the number m of terms is held fixed and the 
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difference Wi{z)— S m (z) approaches zero (at a specified rate) as z approaches 
some fixed value, say z<>. 

There is no reason why a given series cannot be both asymptotic and con- 

00 

vergent at the same time. For example, the geometric series ^ z " is convergent 

n-0 

to 1/(1 — z) for \z\ < 1 and it is also an asymptotic expansion of this function 
as z —■ * 0. 

When a series is convergent as well as asymptotic, the approximation 
can usually be improved at fixed z by taking more and more terms in the par- 
tial sum. However, it happens more frequently that an asymptotic series is 
divergent. 96 When this is the case, there is some limiting value to the accuracy 
of the approximation which can be obtained at each fixed value of z. This limit- 
ing value is attained when the optimal number of terms is retained in the expan- 
sion. Thus, for the expansion (9-15) with z > 0, the right side of the inequality 
(9-24) is a minimum when m \z\ _1 . For this value of m, S m (z) will provide 
the best approximation to wi (z) . 

The theory of asymptotic expansion was initiated by Stieltjes and by Poin- 
care at the end of the nineteenth century. Before giving the general definition 
of an asymptotic expansion, it is convenient to introduce the concept of order. 
Thus, if (j) and i// are complex functions of z, we write 

<f> = 0(4i) as z — »zo 

if there exists a finite constant A such that 


lim 

2 * Zo 


<£00 

0(z) 


<A 


And we write 


if 


= asz— »zo 


lilT1 <£00 

z-**ol//(z) 


0 


96 Usually in this case the terms of the series first decrease rapidly (the more rapidly the closer the independent 
variable is to its limiting value) but eventually start to increase again. 
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The symbols O and o are called order symbols, and in both cases we say that 
<f> is of order (//. Thus, 

z 4 = 0(z 5 ) as z~> 00 

z 4 = o(z 5 ) as z~- * 00 

z 4 — 0(4z 4 + 3z 3 ) asz— »°° 

but 

z 4 ^ o(4z 4 + 3z 3 ) as z~* 00 

Also 

z 5 = o(z 4 ) asz— »0 

but 

z 4 # oCz 5 ) as z— » 0 

and, for z > 0, 

e _z = o(z a ) as — * 00 

for any constant a. 

If f(z) is an analytic function which has a pole of order m at zo, then 
/( z ) = O((z-z 0 )- m ) and f(z) # o( (z-zo)~ m ) asz-»z 0 

It is common practice to denote any sequence, say Oi, a%, as, . . by 
{cij}. A sequence of functions {<fo(z)} is said to be an asymptotic sequence as 
z — ► zo if for each integer j, (f>j+i = o as z~ * zo. Thus, { (z — zo) J } is an asymp- 
totic sequence as z~> z 0 ; {z^} is as asymptotic sequence as z -* and {e~J*} is 
an asymptotic sequence as z— » 00 , provided z> 0. 
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Note that equation (9-28) can be written in terms of the order symbol o as 

u)i —s m =o(z m ) asz— ► 0 

We are, therefore, led to the following definition: Let be an asymptotic 

sequence as z — * zo. Then the formal series 

2 a i < f>j( z ) (9—29) 

j=0 

with partial sums S m (z)= \aj<f>j(z ) is called an asymptotic series . The 

j = o 

asymptotic series (9-29) is said to be an asymptotic expansion to m terms 
of a function f(z) if 

f(z) — S m (z) = o(<f>m) as z — ^ zo (9-30) 

and we write 

m 

f(z) ~ 2 

j=o 

If equation (9-30) holds for every positive integer m, we simply say that 
equation (9-29) is an asymptotic expansion of f(z ) and we write 


f(z)~ 2 a j<t>j(z) 

j=0 


Thus, in the preceding example, we have shown that 



e~ l dt 
l + z£ 


00 

~ S (-D 

«=n 


n n\z n 


This series is typical in that most asymptotic series encc intered in practice 

involve powers or inverse powers of z . Such series are called asymptotic power 
series . 

Once a particular asymptotic sequence has been specified, any given 
function will have only one asymptotic expansion in terms of this sequence. 
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In this sense the asymptotic expansion is unique. However, two different 
functions may have the same asymptotic expansion. Thus, if 


f(z ) ~ dnZ " as z -* 00 

n=0' 

it is not hard to verify that 

f(z) + e-*~ jg anz~ n as z^°° 

n- 0 


for z> 0. 

It can be shown that if f(z) ^ a n (f)n{z) and g(z) ~ ^ bn<f>n(z ) , then 
af(z)+(3g(z) (aa„ + pb n )<l>n(z) 

for any complex constants a and /3. It is generally possible to integrate asymp- 
totic expansions term by term, but differentiation is not always permissible 
(ref. 32) . 

Asymptotic power series can be manipulated in much the same way as 
convergent power series. Let the function f(z) be a single-valued analytic 
function at every finite point z with |z| > R. If/"(z) has the asymptotic expansion 


f{z)-y j a n z~ n asz— *oo 

n= 0 


and this expansion is valid for all values of the argument of z, then f(z) must 
be bounded as z-*oo. Hence, it cannot possess a pole or an essential singu- 
larity at z=oo. Therefore, an analytic function cannot possess a single asymptotic 
expansion in inverse powers of z as z -» oo which is valid for all values of the 
argument of z unless it is analytic at infinity. Thus, asymptotic power series 
are frequently valid only over a given sector of the complex plane, and a 
different asymptotic representation must be used outside this sector. This 
is closely related to the so-called Stokes phenomenon which we shall encounter 
subsequently. 

If a function /(z) is analytic at a point z 0 and, therefore, possesses a con- 
vergent power series about zo, it is not hard to see that the series must also 
be an asymptotic expansion of /(z) as z — * Zo (ref. 32, p. 22). 


266 


ASYMPTOTIC EXPANSIONS 


9.5 NORMAL SOLUTIONS 

Let z o be an irregular singular point of the equation 

w"+p(z)w' + q(z)w=0 (9-31) 

and suppose that, if zo is a finite point, 

p = 0((z — zo)~ m ) and q=0((z — Zo) - *** 0 ) asz^z 0 

and that, if Zo =OCl , 

p — 0{z m ) and q = 0(z (fc_1) ) as z— * 

where m and k are integers. 

We have seen that, when k < m (and only in this case), it is always possible 
to construct a formaf series solution w\ (z) of the form 


Wi{z) = (z — z 0 ) p ^ a n (z — Zo)" ifzoisfinite 


(9-32a) 


or of the form 


Wi(z) = z py £ anZ~ n if z 0 = 00 (9-32b) 

n=0 

which satisfies the differential equation term by term. The coefficients of these 
expansions can be calculated recursively from the differential equation in 
much the same manner as those in the series solutions at regular singular 
points. These expansions will, in general, be divergent. However, as we have 
seen by example they actually turn out to be asymptotic expansions of a true 
solution to equation (9-31) at the point z=z 0 . They will, therefore, be very 
useful for numerical computation 97 of this solution forz nearzo. 

The series in equations (9-32a) and (9-32b) will not diverge only if they 
terminate after a finite number of terms, in which case these equations will 
represent regular solutions to the differential equation (9-31). A second solu- 


7 In fact, asymptotic expansions are quite frequently more suitable for numerical computation than convergent 
power series since fewer terms are required to obtain a given accuracy. 
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tion at the point zo can then always be found from equation (6—63). Thus, except 
in the relatively rare case where k «£ m and the formal power series solution 
terminates, it is still necessary to find one or two solutions to equation (9-31) 
or at least the asymptotic expansions of these solutions about the irregular 
singular point z=zo. To this end, we introduce into equation (9-31) the change 
of dependent variable 


w=e Cl u 


(9-33) 


where 



ifzo is finite 


if zo = 00 


(9-34) 


and the coefficients o) n for n=l, 2, . . ., s are complex constants. Then 
equation (9-31) is transformed into the equation 


u”+p + (z)u' + q + {z)u = 0 (9-35) 

where 

p + (z) =p(z) + 2(V (9-36) 

and 

q+(z)=g(z)+p(z)n , +a"+n , 2 ( 9 _ 37 ) 

Now if, when zo is finite, the terms in fl(z) can be chosen to cancel out 
enough of the negative powers of z — zo which arise from p(z) and q(z) so that 
p + (z) has, at most, a simple pole and q + (z ) has, at most, a pole of order two at 
zo, then zo will be a regular singular point of equation (9-35). Similar remarks, 
of course, apply when z o =0 °. In either case, then, when this can be done, a 
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fundamental set of solutions can always be found by the methods of chapter 6; 
and equation (9-33) will then provide a fundamental set of solutions of equa- 
tion (9—31). But this usually cannot be done. More frequently, it is only pos- 
sible to adjust the coefficients in ft so that the resulting equation (9—35) will 
possess a formal solution either of the form (9-32 a) (whenzo is finite) or of the 
form (9-32b) (when zo is infinite). Then equation (9-31) will have a formal solu- 
tion either of the form 


w(z) = e n(z) (z — z 0 ) p 2) an(z—z 0 ) n 

n-0 

if Zo is finite 

(9-38 a) 

or of the form 



w(z) = e n(z) z p 2 a„z~ n 

n=0 

if Zo = 00 

(9- 38b) 


These formal solutions, called normal solutions , either will terminate or they 
will diverge in which case they will be asymptotic expansions. 

Before discussing this in more detail, it is convenient to introduce the 
following definition: If the coefficients p(z ) and q(z) of the differential equation 


w"+p(z)w' + q(z)w= 0 


(9-39) 


have, at most, poles at zo, the equation is said to be of finite rank at z 0 . If zo is 
a finite point and equation (9—39) is ol finite rank at Zo, the smallest number r 
such that both 


P = 0((z — zo)~ (r+1) ) and <7=0((z-zo)- 2 < r+1 >) asz^zo 


is called the rank of equation (9—39) at zo. The rank of equation (9—39) at 
z= °° is defined to be the smallest number r such that 

p = 0(z r ~ 1 ) and q = 0(z 2ir ~ 1) ) asz-»°° 

provided the equation is of finite rank at °°. If the equation has a regular 
singular point at zo, it is said to be of rank zero. 
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For example, since the coefficients of the equation 


zw"+w' + w=0 (9-40) 

satisfy the conditions 

p = 0(z-‘) q=0(z~') 1 

> as z —■ *• 00 

p 7 £ o(z -1 ) q 7 ^ o(z -1 ) J 

we see that this equation is of rank 1/2 at z=°°. 

Now suppose that equation (9-31) is of rank r > 0 at the point zo. In order 
that equation (9—35) possess a formal solution of the form (9-32a) when z o is a 
finite point, it is necessary that the highest negative power of z — zoin q + exceed 
the highest negative power in p + by no more than 1. And if equation (9-35) is 
to possess a formal solution of the form (9-32b) when Zo =0 °, it is necessary 
that the highest power of z in q + be less than the highest power of z in p+. 
Suppose that zo=°°. An examination of the expression obtained by substituting 
equation (9—34) into equations (9—36) and (9—37) shows that q + will always 
contain higher powers of z than p + unless the numbers a > n and s in equation 
(9—34) can be chosen to cancel out the coefficients of those powers of z in q + 
which are larger than, or equal to, those in p + . This can be done only if the rank 
r of equation (9-31) is an integer. And the maximum value of s necessary to 
accomplish this will then always be less than, or equal to, r. The same con- 
clusion applies when zo is finite. It is usually possible to choose the numbers 
(On and s so that the cancellation is accomplished in more than one way. In 
this case, we obtain two solutions to equation (9-31). 

Thus, equation (9-31) will possess formal solutions either of the form 
( 9~38a ) or the form ( 9-38b ) at the point zo only if the rank r of this equation at 
zo is an integer. The function O(z) is then given by equation ( 9~34 ) with s =Sr. 

Since equation (9-40) is of rank 1/2 at z=°°, it does not possess a normal 
solution at this point. 

As an example of an equation which does possess such a solution at z=°°, 
consider the equation 


w" + q(z)w = 0 


(9-41) 
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with 


q{z)='£q n z~ n (9-42) 

n = 0 

and 

qo # 0 (9-43) 

We see in view of condition (9-43) that equation (9-41) is of rank 1 at z = °°. 
Hence, we make the change of dependent variable 

w — e^u (9-44) 


where, since 5 ^ r= 1, 


Cl = <oiZ (9-45) 

And upon substituting equations (9-44) and (9-45) into equation (9-41) we 
find that 


u" + p + (z)u' + q + (z)u = 0 


(9-46) 


where p + (z) — 2<Wi and q + (z) = (o^ + qo+ qnZ n . The highest power of z 

n= l 

which occurs in both p + and q + is zero. Hence, the highest power of z which 
occurs in q + can be made less than the highest power which occurs in p + by 
choosing oil to be a root of the equation 


<ai + 9o = 0 

But the two roots of this equation are coi,i = iq}/ 2 and a>i )2 =— iq}/ 2 . And each of 
these roots leads to a different form of equation (9—46). Thus, we obtain the 
two equations 
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u" + 2iql l2 u\ + q(z)ui = 0 (9-46a) 

u" 2 — 2iq 1 J 2 u' 2 + q(z)u 2 — 0 (9-46b) 

oo 

where q(z) = 2 QnZ~ n . But we know that these equations possess formal 

,n~ 1 

solutions of the form 


ui = zPi V anz~ n (9-47a) 

n = 0 

and 

u 2 =z p 2 J (9-47b) 

n = 0 

where pi and p 2 are each solutions of linear indicial equations and the coeffi- 
cients a n and b n can be calculated successively from a recurrence relation of 
the usual type. 

Thus, for example, upon substituting equation (9-47a) into equation 
(9-46a), shifting the indices, and collecting coefficients of like powers of 
z in the usual way, we obtain 


2 (pi — n + 2) (pi — n + l)a„_ 2 Z _n 

n = 2 



n 

2iq 1 J 2 (p i — n + l)a„_i + ^ q k a n -k 


= 0 


n = 1 


k= 1 


And after equating to zero the coefficients of like powers of z, we find that for 
n= 1 (since ao ^ 0) 


2iqo /2 pi + qi = 0 


(9-48) 
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and 


[2iqb l2 (pi— n+ 1) + q 1 ]a n -i = — (pi~ n + 2) (pi — n+l)a n -2 


n 

— 2 Qkdn-k for n — 2, 3, . . . 

k = 2 


(9-49) 


Equation (9-48) is the usual linear indicial equation. And substituting this 
into the recurrence relation (9-49) shows that 

-8iq* l2 (n-l)a n -i = [qi + 2iqj ) l2 (n-2)] [q x + 2iq^ (n- l)[a n -> 

n 

— 4 qo ^ qi&n-k forn = 2,3, . . . (9-49a) 

This recurrence relation will either terminate or else it will determine the 
a n so that the series (9— 47a) diverges. 

In any event, the two formal solutions (9— 47a) and (9— 47b) will now 
provide, through equations (9-44) and (9-45), two normal solutions to 
equation (9—41), say w\ and w 2 , which are given by 


Wi = e iq °'z z p i 2) a n z~ n 

n=0 


w 2 =e <‘z z p 2 21 b n z~ n 


n=0 


(9-50) 


Note that if condition (9—43) did not hold (i.e., if qo = 0) but the condition 
qi ^ 0 did hold, then equation (9-41) would be of rank 1/2 at z= 00 . In this 
case, it would not be possible to carry out the procedure just described. 
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9.6 SUBNORMAL SOLUTIONS 

We have seen that if equation (9-31) is of rank r with r > 0 at the point 
zo, it will have a formal solution of the form (9-38a) or (9-38b) only if r is an 
integer. However, a change of independent variable of the type 


[ (z — Zo) llk 



if Zo is finite 
if zo = 00 


where A: is a positive integer, will usually transform an equation of fractional 
rank into one .whose rank is an integer. The formal solutions obtained by this 
procedure are called subnormal solutions. 

Thus, we have shown that equation (9—41) in the preceding example has 
rank 1/2 at <» when condition (9-43) does not hold but the condition q\ ¥=■ 0 does 
hold. That is, the equation 


w"+ q(z)w= 0 


(9-51) 


where 


Q(z) = 2 QnZ' 


n= 1 


(9-52) 


and 


q i^0 (9-53) 

has rank 1/2 at z=°° and, therefore, does not possess a normal solution at this 
point. However, upon making the change of variable £ = z 112 , equation (9-51) 
becomes 


d^w 
d X 2 


|^ + (?(£)m ; — 0 


(9-54) 
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where 


<?(£) = 4£ c- 2(n - 1) qn (9-55) 

n=l 


And since this equation is of rank 1 at £=°°, we introduce the change of 
variable 


w = e“i 


(9-56) 


to obtain the equation 


% +PM) fi +Q ' (Ou=0 


(9-57) 


where 


P i({) t + 2o>i 


<?.«)=-T+“i+49, + 4X,„r !( "^ 

t> n = 2 


Then by putting 


a)f + 4q 1 = 0 


( 9 - 58 ) 


we find that the highest power of £ which occurs in is less than the highest 
power which occurs in Pi. And, as in the preceding example, corresponding 
to the two roots ±i2q\ 12 of equation (9-58) we obtain the following two equa- 
tions from equation (9-56): 


d 2 u i 




u i = 0 
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d 2 u 2 


(i +4<2 )^ +4 [ 


zjSil 

2C 


2 <Zn£ _2(n-1) «2 = 0, 

11 = 9 


But we know that these equations possess the formal solutions Ui = C Pl 2 a n£“" 
00 n= ^ 
and U 2 = C p z 2 whose coefficients can be calculated recursively in the 


«= o 


usual way. And therefore equation (9-51) possesses the two subnormal solutions 


Wl = e aq\l 2 z ll2 z ( Pl i2) y anz~ nl 2 

(9-59) 



tc 2 = e~ i29 l /2 * l/2 Z (p 2 /2) 2 bnZ ~ nl 2 
n=0 

(9-60) 


9.7 nature of rigorous proofs 

The formal solutions obtained in the preceding section are asymptotic 
expansions of certain solutions to the differential equations. One method 
for actually proving this was developed by G. Birkhoff. In this method the 
leading terms of the partial sums of each formal solution are used to con- 
struct a homogeneous differential equation whose solution is known and which 
in a certain sense is close to the given equation when z is near 98 zo. This equation 
is then used to construct a singular integral equation whose solution also satis- 
fies the given differential equation. It is then proved that the solution to this 
integral equation possesses an asymptotic expansion which coincides with 
the formal solution obtained by the methods discussed previously. For a de- 
tailed discussion of this method, in which equation (9-41) is treated to illustrate 
the general principles, the reader is referred to reference 32. 


9.8 CONNECTION OF SOLUTIONS: STOKES PHENOMENON 

In the preceding examples, we have shown how to obtain the asymptotic 
expansions as z —*■ °° of two solutions to a differential equation of the type 


98 Without loss of generality, the point z o (about which the solutions are obtained) is taken as °°. 
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(9-41). The problem frequently arises of connecting these asymptotic expan- 
sions with an expression 99 for some given solution to the differential equation 
which is valid for small values of 2 . We know, in general, that the asymptotic 
expansion of any given solution w to equation (9—41) can be expressed as a 
linear combination of the two asymptotic solutions (9-50). However, it usually 
turns out that the particular linear combination of these two asymptotic solu- 
tions used to represent the asymptotic expansion of the given solution w must 
be changed as the variable 2 crosses certain “critical rays” in going from one 
sector of the complex plane to another. This is the Stokes phenomenon which we 
have mentioned previously. 

For example, the change of variable 


JF=z~ ll2 w 


(9-61) 


transforms Bessel’s equation 


zW" + zW'+ {z i -p i )W= 0 


(9-62) 


into the equation 


w" + 



w = 0 


(9-63) 


which has the form of equation (9-41) with the coefficients in equation (9-42) 
given by 


go = 1 


q\ = 0 



and 


(9-64) 


q n = 0 for n = 3, 4, 5, . . . 


99 Such as a power series expansion. 
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And if we choose the arbitrary constants ao and bo, respectively, to be 

do = (I)" | e -i[(P"/ 2 )+(tt/ 4 )] 'j 

-r i e !'[(pir/2)+(7r/4)] 
it is easy to show that the formal solutions (9-50) become in this case 


/9 \ 1/2 1 °° 

Wi = ( — I 7 - e ![ 2 -(p 7 r/ 2 )-( 77 / 4 )] V a„(- i)”z" 
W 2 n =0 


(9-65) 


^2 


where 


/2 \ I/ 2 1 00 

= ( — j 2 e _i t 2_<p,r/2)_(,r/4)] 2 «n(i)"2" 

' ' 71—0 


(HM 


(9-66) 

(9-67) 


&n 


2 n n\ 


Hence, the asymptotic expansion of any given solution W of equation 
(9-62) can be expressed as a linear combination 


W(z) ~ Ciz - 1 / 2 Wi(z) + Czz~ iI2 w<l(z) 


of the functions (9-66) and (9-67) multiplied by z -1/2 . Let us choose the solution 
W(z) to be the Bessel function of the first kind J P {z). Then equation (8-37) 
shows that for — n < arg z < tt, Ci = C 2 = 1 and 

Jp(z) ~ z~ il2 wi{z) + z~ ll2 w 2 (z) — 7r<argz< 77 (9-68) 

This representation does not apply along the critical ray arg z = tt, that is, 
along the negative real axis. To find an asymptotic expansion which is valid 
in a region which includes the negative real axis, put 
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z = e ni Zi (9-69) 

Then since arg z = tt + arg zi, arg z varies between 0 and 2tt as arg z\ varies 
between — n and it. And it follows from the definition (8-27) of the Bessel 
function that 


Jp(z) =j p (e ni zi) = e npi J p (z i ) for |argz,j < it 

But arg z\ is in the range where J p has the asymptotic representation (9-68). 
Hence, 


J p {z) ~ e nip Zi ll2 wi(zi) + e vip z^ ll2 w 2 (zi) for |arg zi\ < tt 
And since equations (9-66) and (9-67) show that 

Wi(z \ ) = e~^ pir+ ^ l2 ^w 2 (z) w 2 (zi) = e'^+W^Wiiz) 


it follows that 

J p (z) ~ z~ ll2 Wi{z) —e 2nip z~ ll2 Wi(z) for 0 < arg z < 277 (9-70) 

Comparing equations (9-68) and (9-70) shows that different linear com- 
binations of w>i(z) and w 2 (z) must be used to represent the asymptotic expan- 
sion of J P (z) in the sectors — tt < arg z < it and 0 < arg z < 277. And, equa- 
tions (9—66), (9—67), and (9—70) show that 

J p (z) ~ e -ip+(3^/2) (JL^ 1 sin ^ z+ EE- 

to one term, for 0 < arg z < 277 (9—71) 

Since in each region of the complex plane one of the functions, wi(z) and 
w 2 (z), will be exponentially small compared with the other, the expansions 
(9-69) and (9-70) will be equal to one another, with exponentially small error, 
in every region where they are both defined. 
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Expansions in Small and Large Parameters: 
Singular Perturbations 


Many of the differential equations encountered in practice contain param- 
eters. Although it frequently happens that we cannot find the exact solutions 
to these equations, the behavior of these solutions for large or small values of 
the parameters is often physically important. In such cases we are content to 
find asymptotic representations of the solutions which are valid for these 
limiting values of the parameters. A number of techniques for obtaining such 
expansions are given in this chapter. Since no formal theory has been developed 
for many of these techniques, the approach of this chapter is necessarily 
heuristic and the material is frequently presented by means of examples. 
On the other hand, the ideas developed herein are quite general; and they 
apply not only to ordinary differential equations, but also to partial differential 
equations, integral equations, and even difference equations. 

10.1 NONSINGULAR EXPANSIONS OF SOLUTIONS 

The general second-order homogeneous linear equation containing a 
large parameter X is of the form 


fs + p( *’ + x)y=0 


( 10 - 1 ) 


We suppose that p(x, X) and q(x, X) have (at least formal) power series ex- 
pansions in X. If these expansions contain only nonpositive powers of X, say 
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p(x, X) = j? Pn(x)\~ n (10-2) 

71=0 

g(*,X)=2 ^(x)k~ n (10-3) 

n - 0 

then the solutions to equation (10-1) will have a formal power series expansion 
of the form 


co 

y(x) = $ y„(x)\- n 

71=0 


(10-4) 


where the functions y n (x) are determined by substituting the expansion 
(10-4) into equation (10-1) and equating to zero the coefficients of like powers 
of X to obtain 

Po(x) 4o( x )y n = — 2 Pn-k(x) -j^ + g n -k(x)yk j 

for n = 0, 1, 2, . . . (10-5) 


and the sum on the right is omitted for n — 0. These equations can, at least in 
principle, be solved successively for the coefficients y n . If the series (10-2) 
and (10-3) converge, it can be shown (ref. 4, p. 126, ex. 6) that the series (10-4) 
will also converge. In any case, the formal series (10-4) will usually be an 
asymptotic expansion. 

However, if the formal power series expansion for either p(x, X) or q(x , X) 
involves any positive powers of X, the expansion of the solution in powers of 
X will, in general, involve infinitely many positive and negative powers of X; 
and it will not be possible to solve for the coefficients successively. Neverthe- 
less, it is still possible in certain instances to obtain formal solutions whose 
terms can be calculated successively by using a technique analogous to the 
procedure used in the preceding chapter to obtain solutions at irregular singular 
points. These solutions will not, in general, be convergent series; but they 
will be asymptotic expansions. Thus, consider the equation 

0+pU, X) ^+q(x, X)y= 0 (10-6) 
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where 


p(x, X) = J Pn(x)k k - 


n=0 


(10-7) 


00 


q(x, x)= 2 qn(x)\ 2k n 

n = 0 


( 10 - 8 ) 


A: is a positive integer, and po(*) and qo(*) are not both identically zero. Then 
it can be shown by direct substitution that equation (10-6) will possess two 
formal solutions of the form 


y=e (Hx) £ y n (*)A-” (10-9) 

n=0 

where 

f l(x ) = O >n(x)\ k ~ n (10-10) 

n = 0 

The coefficents y n and o>„ can be calculated by solving successively the set 
of ordinary differential equations which is obtained by substituting the assumed 
solution (10-9) into equation (10-6) and equating to zero the coefficients of 
like powers of X. 

In order to illustrate these ideas, consider the differential equation 

j^X 2 q 0 U) + ?2U)j y = 0 (10-11) 

which was first discussed by Liouville in his classical investigations of the 
Sturm-Liouville problems. 

Since in this case Ar = 1, the differential equation will possess formal 
solutions of the form 


y=e"o(*>A ^ y„(x)\~ n (10-12) 

n = 0 
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After substituting this into equation (10-11) and collecting terms, _we find that 

(y'n+ 92 y«)^“ n + X (2w^+ Woyn)X 1_ " + 2 (<7o + (Oo 2 )y„X. 2 ~ n = 0 

n= 0 n=0 n=0 

But upon shifting indices, this becomes 
^ (y»-2+g 2 yn-2)A 2 " n + ^ (2 (o^h-i + fc>oyn-i) ^ 2_n 

n=2 n=l 

+ 2 (Qo + o>; 2 )y„\ 2 -" = 0 

n = 0 

And by equating the coefficients of the like powers of A to zero, we get 

(go + (Oo 2 )y 0 = 0 for n = 0 (10-13) 

Zto'oy'o + &>oyo + (go + a>o 2 )yi = 0 forn=l (10-14) 

and 

y »-2 + q&n-z + 2ft>oy^_ t + (DoYn-i 4- (qo + c«Jo 2 )y n = 0 for n = 2, 3, . . . 

(10-15) 

In order to avoid the trivial solution y=0, we must require that yo^O. 
Hence, equation (10-13) becomes 

qo + o>; 2 = 0 (10-16) 

When this is used in equations (10-14) and (10-15), we obtain 


2<*^yo+ ft> oyo — 0 


2 &>oy«-i + ft>o'yn-i = - y'n-2 - qzyn-2 


for n = 2, 3, 


(10-17) 
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In the case of the differential equation (10-1) we had to solve the set of 
second-order differential equations (10-5) to determine the coefficients y„; 
whereas, in this case it is only necessary to solve a set of first-order equations. 100 
Since qo ^ 0 equation (10-16) will have two distinct solutions, say a>o , i and 
given by 


o»o, i — h J V= 


o>o, 2 — J 


V— qo dx 

(10-18) 

V— qo dx 

(10-19) 


Corresponding to each of these roots, we will obtain a different set of 
equations from equations (10-17) and, therefore, equation (10-12) will yield 
two different formal solutions to equation (10-11). It is proved in reference 
32 (p. 83), that these two formal solutions are actually asymptotic expansions 
of two linearly independent solutions to equation (10-11) in any finite interval 
a < x < b in which qo does not take on the value zero. The first equation 
(10-17) will yield the same equation for both of the roots (10-18) and (10-19). 
Thus, when equation (10-16) is substituted into the first equation (10-17), 
we obtain the separable equation 





which is easily solved to obtain 

yo— constant X q^* (10-20) 

Equations (10-18) to (10-20) can now be substituted into equation (10-12) 
to show that the general solution of equation (10-11) in any interval a < x< b 
where qo(x) 0 has the asymptotic expansion 

y ~ CiqQ ll4 e x l' / - q ’ Jx +C 2 qo ll4 e~^' /= * dx to one term (10-21) 


100 This situation is analogous to the reduction of the indicial equation from a quadratic to a linear equation in 
going from a regular singular point to an irregular singular point. 
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where Ci and C 2 are arbitrary constants. Thus, <?o(*) must be either strictly 
positive or else strictly negative for all a < x < b. If qo is strictly negative, the 
constants of equation (10-21) can be redefined slightly to obtain 

y ~ C 3 (~q 0 ) + C 4 (- qo)- 1 '^-^ ^ 

to one term, for q<*(x) < 0 (10-22) 

And if qo(x) is strictly positive for all a < x < b, the constants can be redefined 
to obtain 

y ~ Ciq ^ 1/4 sin ^A J Vqodxj + C 2 q $ 1/4 cos ^A J Vqodx^j 

to one term, for qo(x) > 0 (10-23) 

Notice that the asymptotic solution (10-22) has a monotonic behavior, 
while the asymptotic solution (10-23) has an oscillatory behavior. 

10.2 TRANSITION POINTS 

Now let us consider the case where qo(x) is equal to zero at a single point, 
say xo, in the interval a<x<b. And suppose, in addition, that qo(x) and 
q 2 (x) are analytic at jco- Then qo(x) and q 2 (x) must have the expansions 

qo(x) = qQ,i{x — xo) + qo, 2 {x — xo) 2 + . . . (10-24) 

q 2 (x) = q 2 ,o+q 2 ,i(x — x 0 ) + . . . (10-25) 

about the point a: = JCo, and we shall require for definiteness that qo,i # 0. 

Thus, the point xo is an ordinary point of the differential equation (10-11). 
Every solution to this equation must, therefore, be analytic at *o. But in view 
of equation (10—24), equation (10-21) implies that 

y=0((x — x o) -1/4 ) as x —> xo (10-26) 

Hence, the asymptotic expansion (10-21) cannot represent an analytic func- 
tion in a neighborhood of xo and, therefore, it certainly cannot be the asymp- 
totic expansion of any solution to equation (10-11) in the neighborhood of 
this point. 
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However, for any positive number 8 (no matter how small), equation 
(10-21) provides a valid asymptotic expansion, to one term, of the general 
solution to equation (10-11) both in the interval a < x < x o — 8 and in the 
interval xo + 8 < x < b because qo(x) does not vanish in either of these 
intervals. Since equation (10-24) shows that qo(x) is not tangent to the 
jc-axis at xo , it must cross the axis at this point and must, therefore, be positive 
in one of the intervals. The solution to equation (10—11) will then have the 
asymptotic expansion (10-23) in this interval. In the other interval, qo(x) will 
be negative and equation (10-11) will have the asymptotic solution (10-22). 
Thus, the asymptotic solution to equation (10-11) will have an oscillatory 
behavior on one side of Xo and will have a monotonic behavior on the other side. 
The transition from one type of behavior to the other takes place in a small 
region centered at xo. The point xo is, therefore, called a transition point. 101 

When the second equation (10-17) with n — 2 is solved for yi(ac) and 
equations (10-16) and (10-20) are substituted into the result, it is found 
from equations (10-24) and (10-25) that 


yi=0((x— xo)~ 7li ) asx-*xo 

But equation (10-26) shows that yo is of order (x — *o)~ ,/4 as x—> xo. Hence, 
the second term in the expansion (10-12) will be approximately equal to the 
first term when 


| (x — JCo) - 1/4 1 « X ^(a; — Xo) 7/4 | 

that is, when the distance between x and xo is approximately \x — xo\ ~ A~ 2/3 . 

Thus, when the distance between x and Xo is less than or equal to A -2 / 3 , the 
order of the terms of the expansion (10-12) will actually increase with increas- 
ing n. This shows that the asymptotic expansion breaks down in a region of 
radius A~ 2/3 about the point xo . It is, therefore, said to be a nonuniformly valid 
asymptotic expansion. 

In order to obtain an asymptotic expansion which is valid in this region, we 
rescale the independent variable so that it will be of order 1 therein. That is, 
we introduce the new independent variable 

{ =(*-*«) A 2/3 (10-27) 


101 The term turning point is also used. This term arises from the application of eq. (10-11) to classical and quan- 
tum mechanical wave reflection problems. 
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into equations (10-11), (10-24), and (10-25) to get 


d 2 y . 

[ X2/3<7o 

(JL 

'll 1 * 

(l ^«)] y=0 

(10-28) 

d? 

V X 2 / 3 

j+ X 4/3 92 

Qo ( 

K X 2 / 3 ) 

= Qo,i 

1 . 

X 2/3 ' 90.2 

CO 

(10-29) 


92 ( ^j = Q2,o+q2,i-^ + • • • ( 10 - 30 ) 

Then upon substituting equations (10—29) and (10—30) into equation (10—28) 
and neglecting terms which are small for large values of X, we get 


^5 + 9o,i£y=0 (10-31) 

The solution to this equation should be “close” to the true solution to equation 
(10-11) at least in the region 


\x-xo\ <X" 2/3 (10-32) 

or |£| < 1. Hence, it should represent the first term of the asymptotic expan- 
sion of the solution to equation (10-11) in the region (10-32). 

The differential equation (10-31) is a form of Airy’s equation (ref. 29, 
section 6.4) which can be transformed by the change of variable 

y=VgY £=| 

into the Bessel’s equation 


288 


SINGULAR PERTURBATIONS 


♦ Ms )’] 1 '- 0 

(see eq. (8-24)). Thus, the general solution to equation (10—31) is 

y= ( | , ( | f s « ) 

+ ( f X ,,J )' B D z ('i‘J-,n ( f V^7.e" 2 ) (10-33) 

where D\ and D 2 are arbitrary constants and the normalization factor (7 tX 1/3 /3) 1/2 
has been inserted for convenience. 

Let us now suppose, for definiteness, that 

qo, I > o (10-34) 

Then solution (10-22) will hold in the region a <x < x o — 8 and the solution 
(10-23) will hold in the region xo + 8 < x < b, where 8 is some arbitrarily 
small positive constant. The solution (10-33) holds in some region which is 
centered at xo and has a size at least of order A -2/3 . Having found the pieces 
of the solution which apply in the three regions into which the interval a< x< b 
has been divided, it is now necessary to match up these pieces across the ad- 
jacent regions to form one continuous solution. These pieces of the solution 
contain altogether six arbitrary constants. Four of these will be determined by 
the matching requirement. The other two must remain arbitrary if we are 
to obtain an asymptotic expansion of the general solution to equation (10-11). 
In order to accomplish this matching, we suppose that the regions of validity 
of the various pieces of the solution can be extended in such a way that any 
two adjacent regions overlap one another. Thus, we assume there is an “ over- 
lap domain ” (or intermediate region) in which both the expansions (10-22) 
and (10-33) are asymptotic expansions of the solution to equation (10-11) 
and that there is an overlap domain in which both the expansions (10-23) and 
(10-33) are asymptotic expansions of the solutions to equation (10-11) (see 
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Region of applicability of solution (10-33), "inner region" 

^ A X 

"Outer region", region of ap- "Outer region", region of appli- 

plicability of solution (10-22) ca bility of solution <10-23) 

;\\\\\\XXX)6y/ /J /OOOOOOftXXXXXV rt 

Overlap Overlap 

domain domain 


FIGURE 10-1. — Regions for transition-point expansion. 


fig. 10 - 1 ). We therefore require that within the overlap domains the two 
adjacent expansions agree with each other to within an error which is of 
smaller order than the last term retained in these expansions. 

Since the size of the region in which the solution (10-33) applies (which 
we will call the inner region) approaches zero as A— » °°, the location of the 
two overlap regions must also approach Xo as A — > Now in the outer regions, 

the asymptotic expansions correspond to holding the variable x fixed and 
taking the limit A— I n the inner region, the asymptotic expansions corre- 
spond to holding the variable £= (x — xo)A 2/3 fixed and taking the limit A — > 
This limiting process allows the coordinate x in the inner region to move toward 
xo as the size of this region shrinks to zero. Thus, the matching can be per- 
formed in a precise manner by introducing into both the inner and outer ex- 
pansions an “ intermediate variable ” 

7) — (x — x 0 )A“ = £A [a-(2/3) l (10-35) 

with a chosen so that a fixed value of 17 will remain in the overlap region as 
this region shrinks toward xo with increasing A. The matching of the adjacent 
expansions is then accomplished by changing their independent variables 
from x and £ to rj and then requiring that these expansions become identical 
(to the appropriate order in A -1 ) when the limit A— » °° is taken while holding 
y) fixed. However, if 17 is to lie in the overlap regions, we must require that 

2 

0<a<^ (10-36) 
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Now in order to apply the matching procedure to this problem, we intro- 
duce the new variable (10-35) into equation (10-33) to obtain 

y= (f X1 ) 12DiV 1I2J 'I 3 {l T, 3/2 X [(2/3) -“ ]3/ 2j 

+ (| | | r ) 3/2\[(2/3)-a]3/2j (10 _ 37 ) 

First, suppose that 17 > 0. Then equation (8-37) shows that in the limit A— » 00 
with 17 fixed 


y ~ ( q£rn ) ' ' ' " • d . cos { § • - «w» - |§ } 


+ (l^n)'' D ’- cos{!v^^, 


to one term 

(10-38) 


But for 17 < 0 we find that T7 3/2 =e ,(3ir/2) |i7| 3/2 ; and therefore the argument of 
the Bessel’s function in equation (10-37) is no longer in the range where equa- 
tion (8—37) holds. We must, therefore, use equation (9—71). 

In this case, we see that in the limit as A —■ * °o with 17 fixed 

1 / A a \ 1 1 4 (2 , 

y ~2(^7hl) <*-»•>« p{5^l 

1 / 

+ n ( i — r ) (Die~ ,nl6 + D 2 e t1Tl6 ) exp 

2 V 9o,i h|/ 

to one term (10-39) 


Yj | 3/2 ^[2/3— a]3/2 j 

|-| VqZl |^|3/2X[2/3-a]3/ 2 J 
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Next substituting equation (10-35) into equation (10-23) and neglecting 
the higher order terms in X -1 , we find that 



(10-40) 


Hence, the expansion (10-38) and the expansion (10—40) will become identical 
in the right overlap domain if 


Ci = Di sin D 2 sin 


C 2 = D i COSr^+ D 2 COSy^ 


and the expansion (10-23) becomes 

y ~ Diq ^ 1/4 cos ^ X f Vgo dx — \ + D 2 qQ 114 cos f X f Vqo dx I 

\ J Xex 12 / w \ J Xn 12/ 


to one term, for Xo < x < b (10-41) 


In order to match the solutions m the left overlap domain, we must consider 
two cases. First, suppose that D\ ^ D 2 . Then the second term in equation 
(10-39) is negligibly small compared with the first and may be neglected. 
Hence, equation (10-39) becomes 

1 / X“ \ 1/4 (2 , 1 

y ~ ^ j^j" j (^2 _ eX P | 3 ^90, 1 | f) |3/2\[(2/3)-a]3/2 

(10-42) 

This must now match with equation (10-22) in the left overlap domain. It 
can be seen that this matching can occur only if C 3 ^ 0. In this case, the 
second term will be exponentially small, compared with the first; and equation 
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(10-22) becomes, upon substituting in equation (10-35) and neglecting 
higher order terms in 1/A while holding 17 fixed, 


Cs exp {I ^ } 


But this will match with equation (10-42) in the left overlap domain only if 
Cz = (D 2 — Di) 12. And the solution (10-22) therefore becomes 


D 2 — D 1 


(— Qo ) _1/4 exp ^ A J V— go dx ^ 


to one term, 


for a < x < xo and D\ # D 2 


(10-43) 


Next consider the case where D\ = D 2 . Then the first term in equation 
(10-39) is zero and, therefore, the second term cannot be neglected. Equation 
(10-22) will only match onto equation (10-39) in the left overlap domain 
if C 3 = 0 and Ct= ( V3/2)Z)i. Therefore, equation (10-22) becomes 


y 


<7o) -1/4 exp 




to one term, 


for a < x < *0 and D\ = D 2 


(10-44) 


Thus, the complete asymptotic expansion of the general solution to 
equation (10—11) in the interval a<x<b is given by equations (10—33), 
(10—41), and (10—43) or (10—44). The expansions (10—41) and (10—43) or 
(10-44) become poorer and poorer representations of the true solution as 
x—>xo (they are nonuniformly valid expansions), but the expansion (10-33) 
provides a good representation in this region. It is possible to obtain a uniformly 
valid asymptotic expansion in the region xo^x<b by adding the expansion 
(10-41) to the expansion (10-33) and (so that it will not be counted twice) 
subtracting their common value in the transition region given by equation 
(10-38). Thus, 102 


yuniformly valid (18 41)+ eq. (10 33) eq. (10 38) 


102 By construction, eqs. (10-33) and (10-38) are asymptotically equal in the outer region. 
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Similar remarks, of course, apply for the interval a < x xo. Although this 
method is very general and can be used even when a larger number of terms are 
retained in the asymptotic expansion, there is a better method of obtaining a 
uniformly valid one-term asymptotic expansion for the present problem which, 
in fact, applies to the entire interval a < x < b. To this end, notice that the 

term X f Vqodx becomes asymptotically equal to the term 

Jx 0 

|x Vgo.i 

in the transition region. It is, therefore, reasonable to hope that the range of 
validity of equation (10-33) (whi ch applies for small values of 103 x— xo) can be 
extended by replacing 2/3 Vqo.i £ 3/2 in the arguments of the Bessel functions by 

xj. ^qodx and replacing the factor X 1/6 by-go - 1/4 j ^/qodx^j .Upon 

making these substitutions, equation (10-33) becomes 

y~<7o 1/4 (^X Vq^dx'j jz)Ji/ 3 ^xj^ Vq~ 0 dx^j + D 2 J -1/3 ^X J* Vg^dxj 

(10-45) 

Having obtained this result by a heuristic argument, it is now easy to verify 
that, for X 2/3 (x — x 0 ) = O(l), this equation agrees with equation (10-33) to the 
lowest order X -1 and, for x — xo = 0(l) with x > Xo, it agrees to lowest order in 
X -1 with equation (10-41). Similarly, when x— xo=0(l) and x<xo, it agrees 
to lowest order in X -1 with either equation (10—43) or equation (10—45), de- 
pending on whether Di is equal to ZX. Thus, equation (10-45) must represent 
a uniformly valid asymptotic expansion to one term of the general solution to 
equation (10—11). This result was first obtained by Langer (refs. 33 to 35) by a 
more formal procedure. The technique of using inner and outer expansions to 
obtain a uniformly valid expansion began with Friedrichs (ref. 36) in the 1950’s 
and was developed by Kaplan, Lagerstrom, Cole, and many others. This 
technique not only applies to linear and nonlinear ordinary differential equa- 
tions but also to partial differential equations. Many of the applications of this 
method have been to fluid mechanics problems. In fact, the basic ideas grew 
out of boundary layer theory. 


103 This is because f is of order 1 and, therefore, for large X, x must be close to xo ■ 
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10.3 MATCHED ASYMPTOTIC EXPANSIONS 

The method of using inner and outer expansions to obtain a uniformly 
valid expansion can, in fact, be applied to obtain solutions in a systematic 
way to certain equations which are of the form (10-6). The solution (10-9) 
to equation (10-6) was obtained by essentially guessing its general form. 
Therefore, this procedure is limited to equations of the type (10-6); the method 
of matched asymptotic expansions has no such limitation. 

The method is usually applied in such a way that it is necessary to consider 
the boundary conditions along with the differential equation. The ideas in- 
volved are best illustrated by means of an example. However, instead of con- 
sidering an equation containing a large parameter we will now consider an 
equation containing a small parameter 104 e. Thus, we shall seek an asymptotic 
expansion as e —* 0 of the solution to the equation 

€ ltf +P ^ fa + q ^ y= ° (10-46) 

subject to the boundary conditions 

y(0) = 1 (10-47) 

y(l) = a (10-48) 

and where the functions p(x) and q(x) are any functions which can be repre- 

sented by power series, say 


P(x)= p n x n 

n = 0 


Q(x) = 2 


n=0 


(10-49) 


near x=0, and by similar power series, say 


P(x) = X Pn( 1 -*)" g(x) = Y q„(l — *)" 

n^D n=0 


104 Of course, since we can always put A=l/e to obtain an equation containing a large parameter, there is no 
real difference involved. 
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near x— 1. We shall also require that 

p(*)>0 for 0 * =£ 1 (10-50) 

and that the integral [q(x)lp(x)]dx exists. 

Since e is small, it is natural to seek a solution to equation (10-46) in the 
form of a power series in e. Thus, let us try to obtain a formal solution to equa- 
tion (10-46) of the form 


oo 


y(x; e) = 2 yn(x)e n 
n= 0 

(10-51) 

Upon substituting this into equations (10-46) to (10-48) and equating to zero 

the coefficients of like powers of €, we find that 


p (*) o=0 

(10-52) 

p(*) X +q( x )yn~ J X 2 for n- 1,2, . 

(10-53) 

ro(0) = i 

(10-54) 

y o(l) = a 

(10-55) 

y«(0)=y„(l) = 0 for n= 1 , 2, . . . 

• (10-56) 


Since equations (10-52) and (10-53) are first-order linear equations, they 
can be solved immediately to obtain 


yo(x) = Coe -0 ^ 


(10-57) 


y n (x) = e -ft(x) 



gft(x) 


1 d 2 y n - 


p(x) dx 2 


-dx 


fo'rn= 1, 2, . . . 

(10-58) 
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where the C n for n = 0, 1, 2, . . . are arbitrary constants and we have put 



(10-59) 


Notice that since the integral 11(0) exists by hypothesis, the integral (10-59) 
must certainly be finite for all x in the interval 0 x 1. 

Now, except in the very exceptional circumstance where a = e ft(0) , it is 
impossible to choose C 0 so that the zeroth-order solution (10-57) satisfies both 
boundary conditions. Thus, suppose that a ¥=■ and, therefore, that one of 
the boundary conditions (10—54) or (10—55) cannot be satisfied. We reason that, 
just as the expansion (10-12) broke down at the transition point, the expan- 
sion (10-51) will break down at one of the boundary points x = 0 or x=\ 
and it will be necessary to obtain a different (inner) expansion in this region. 105 
This expansion must then be matched smoothly onto the outer expansion 
(10-51) in some intermediate region. It is necessary to investigate the be- 
havior of the asymptotic solutions at both boundaries in order to find which 
one will correspond to the boundary layer region. If this is done, it will be 
found that, due to the condition (10-50), it will be impossible to match any 
inner solution which occurs near the boundary x = 1 to the solutions (10-57) 
and (10-58). This is due to the fact that condition (10-50) will cause all possible 
solutions for the region near x= 1 to grow exponentially with the distance 1 — x 
and this type of behavior is incompatible with the solutions (10-57) and (10-58). 

The constants C„ for n = 0, 1, 2, . . . must, therefore, be determined 
so that the boundary condition (10-55) and the second boundary condition 
(10-56) at x= 1 are satisfied. Hence, Co= a and C„ = 0 for n= 1, 2, . . .; and 
the solutions (10-57) and (10-58) become, respectively, 

yo ( x ) = ( 1 0-60 ) 

y n (x)=-e -«<*> f^ U) ^^x^dx for n= 1,2, . . . (10-61) 


The functions y n (x ) for n= 1, 2, . . . can now be determined successively by 
substituting the expression for y„_i obtained in the previous step into the right 
side of equation (10-61). Thus, substituting equation (10-60) into equation 
(10-61) with n = 1 gives 

105 Which is frequently referred to as the boundary layer region. 
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yi(x) = — ae _n(jp) 



1 

p(x) 


[ r 2 (x ) — r' (x)~\dx 


(10-62) 


where we have put 


r{x) = 


q(x) 

p{x) 


(10-63) 


and we shall suppose that the integral 


(10-64) 


exists. 

Unlike the situation which occurred at the transition point, the zeroth- 
order solution remains bounded at x=0. In fact, its limiting value is ae _n(0) . 
But in order that the boundary condition (10-47) be satisfied, it is necessary 
that the asymptotic solution change from the value 1 to the value ae _n(0) across 
the boundary layer region. Now we anticipate that the expansion (10-51) will 
hold over most of the region 0 «£ x 1 but that it will break down in a narrow 
region near *=0 whose thickness approaches zero as e— » 0. Since the asymp- 
totic solution y must change from to 1 across this very narrow region, 

the derivatives y' and y" must become very steep (large) in this region. How- 
ever, in assuming that the solution had an asymptotic expansion of the form 
(10-51), we were essentially treating the terms y", py ' , and qy in equation 
(10-46) as if they were of order 1. In order to overcome this difficulty, we pro- 
ceed just as in the case of the turning point and rescale equation (10-46) in a 
manner which is appropriate to this boundary region by introducing a new 
“stretched” independent variable x by 


x 


x 

*(«) 


(10-65) 


where lim </>(e) — 0 and the function d> is to be chosen so that it is of the scale 

€ 0 

of the boundary layer region. In this case, however, it is necessary to determine 
the scaling function (f> in a different manner than in the case of the turning point. 
Thus, upon substituting equation (10-65) into equations (10-46) and (10-49), 
we obtain 
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0(«) d 0 " 66 ) 

where we have put y (Jc; c) = y and, in view of equations (10-49) and (10-50), 
p(<f>(e)x)= p 0 + Pi<l>(€)x + p 2 <t> 2 (e)x 2 + . . 

for po # 0 

g(0(e)^)= g 0 + 9x<^(€)^ + g 2 <^) 2 (e)^ 2 + • • . 

(10-67) 


Now in order that the inner solution satisfy the boundary condition at 
x = 0 and still contain another constant which can be adjusted to match the 
outer solution, we must require that the equation which is satisfied by the 
lowest order term in the expansion of the solution in the inner region (the inner 
expansion) be of second order. This equation is obtained by holding x and y 
fixed and taking the limit 106 e—^0 in equation (10-66). However, the limiting 
form of this equation depends on the choice of the function <f>(e). And in order 
that the highest order derivative be retained in this equation we must require 
that 


<f>=0(e) ( 10 - 68 ) 

If the condition <f>=o(e ) also held, the second and third terms of the 
limiting equation (10-66) would drop out and we would be left with the equation 


_e_ <P± = 0 

<£ 2 (e) dx 2 


(10-69) 


However, if we require that 


<f>*o(e) (10-70) 

then both the first and second terms will be retained in the limiting form of 
equation (10-66). In addition, the size of the inner region (which is determined 
by 0) will be larger if condition (10-70) holds than if it did not hold. Thus, the 


106 This limiting process is called the inner limit. The limit taken while holding x and y fixed and letting e— > 0 is 
called the outer limit. 
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solution which would be obtained by assuming that condition (10-70) did not 
hold is a limiting case (for small x ) of the inner solution which is obtained when 
equation (10-70) does hold. We shall, therefore, assume that conditions (10-68) 
and (10-70) hold. And since only the order of magnitude of <f> is important, we 
will lose no generality by putting <f>(e) = e. Hence, equation (10-66) becomes, 
upon inserting equation (10-67), 

+ (p 0 + epix + e 2 p 2 x 2 + . . . ) ^j| + (eq 0 + e 2 qix + . . .)y=0 (10-71) 


We now suppose that the solution to equation (10—71) has an asymptotic 
expansion of the form 107 


y(x; e) ~ ]£ y n {x)e n 


w = 0 


(10-72) 


Then upon substituting this into equation (10-71) and equating to zero the 
coefficients of like powers of e, we obtain 


d 2 yo 

dx 2 


+ Po^-° 


(10-73) 


d*?n 
dx 2 




forn=l,2, . . . 


(10-74) 


where we have put 


H n (x)=~ 



XPn-k~^~ ykQn-k-l^ X n k 1 


for n= 1,2,. . . 

(10-75) 


And upon substituting equation (10-72) into the boundary condition (10-47), 
we find that 


yo(0) = l 


(10-76) 


107 It should not be concluded from the results obtained so far that the asymptotic expansions of the solutions to 
equations containing a small parameter will always be power series in the parameter. For example, logarithmic terms 
could occur. 
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y„(0)=0 for n= 1,2, . . . (10-77) 

Since equations (10-73) and (10-74) do not contain the dependent variable 
explicitly, they can easily be integrated by the methods of chapter 4 to obtain 


yo=c 0 +b 0 e po * 

y n = c n + b n e~ P °*+ — f [1 -e Mf ~ S) ]H n (y)dy forn=l,2, . . . 
Po J o 


which become, after applying the boundary conditions (10-76) and (10-77), 


yo=c„+(l-c,)e- w (10-78) 

y„=5.(l-e"")+-f for n= 1,2, . . . 

P 0 Jo 

(10-79) 

We can again determine the functions y n for n=l, 2, . . . successively by 
substituting the expressions for y n - 1 obtained in the previous step into the right 
side of equation (10-79). Thus, upon substituting equation (10-78) into the 
right side of equation (10—79) with n = 1 and carrying out the integration, we 
obtain 


yi = ci (1 


e -P°x) -f 


(1 - Co) _ 
xe , s 

Po 


I PlPo _\ _ Qo _ 

( 9o- Pi 2 ~x\-Co^- 0 x 


(10-80) 


where we have put 


ci = Ci + — [(1 - Co) (pi - qo) + co^o] 

Po 
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As in the case of the transition point, we now suppose that there is an 
overlap domain in which the outer expansion (10-51) and the inner expansion 
(10-72) both apply and that within this overlap domain, these two expansions 
agree to within an error which is of smaller order than the last term retained. 
In order to perform this matching, we introduce an intermediate variable 


x _ ex 
p(e) v(e) 


(10-81) 


into both the inner and outer expansions. The scale factor v(e) is determined 
so that the variable x + is of order 1 in the overlap domain. Since the size of 
the overlap domain must approach zero as e —* 0 and since it must be farther 
away from x = 0 than the inner regions whose size is 0(e ) , we must require that 

e = o(v) (10-82) 


and 


i>=o(l) 


(10-83) 


To proceed with the matching, we then reexpand the first m terms for m = 1,2, 
. . . of both the inner and outer expansions while holding x + fixed and then 
require that the difference between these two expansions be of o{e m ~ l ). That 
is, the inner and outer expansions are asymptotically equal in the overlap 
domain to the appropriate order. Hence, we require that 


yoOU)*-'-] + 


«yi|>(e)* + ] + . . . + e n y n [p(e)x + ] — y 0 



— eyi 




as e — > 0, 


for n = 0, 1, 2, . . . with x + fixed (10-84) 
Thus, for example, when n = 0, this becomes 

yo|V(e)* + ] — yo ^ * + 1 =o(l) as e — » 0 with x + fixed (10-85) 
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And when n = 1 , it becomes 

yo |>(€)* + ] + eyi |>(e)* + ] “ — efi J = o(e) 

as € —* 0 with x + fixed (10-86) 


Now since 


wrw**]- r tgJ B^= r*£u 

Ji p(™ + ) J 1 ptf) h p(i) 


+ 


lo* + %W d * = n(0) + V(e)X+ ^ + 0{v2) 

M 

it follows, upon expanding the exponential, that 


e -n[K«)x + ] = e -ft( 0 ) e -px+( 9 o/po)+ 0 (v 2 ) = e -ft(o) j — v(e) — x + +0 (v 2 ) 

L Po 

And, the solutions (10-60) and (10-62) therefore become, respectively, 


y 0 [i'(€)* + ] = ae -ft(0) £l — v(e) x + +0 (v 2 ) j (10-87) 

y![v(e)x + ] = ae-°l 0 )\A+0(v)] ( 10 - 88 ) 

where definition (10-64) has been used. On the other hand, the solution (10-78) 
becomes 


yo 





= c 0 + (1 — co)e~ Po[, ' ( * )/<]x+ 


But, since equation (10-82) shows that 
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Equations (10-87) and (10-89) now show that 

yo[i'(€):*: + ] — jo j == ae _ft(0) — c 0 +0 (v) 

since the exponentially small terms are of lower order than any power of v. But 
since lim v(e) = 0, this shows that the zeroth-order matching condition (10-85) 
will be satisfied provided that 

c 0 = ae -ft(0) (10-91) 

Similarly, equations (10-86) to (10-90) show upon substituting in equation 
(10-91) that 

yo[v(€)x + ] + €yi[t'(e)x + ] — jo ^^-x + — eji 

= ae -n(0) ^1 — v^x + + eA^ — ae _n(0) ^1 — v ^ x + ^j — eci+ 0(v 2 ) 
= e[e~^ 0) aA — Pi] + 0(v 2 ) 
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Hence, the first-order matching condition (10-86) will now be satisfied if we put 
Ci = ae~ n(0) A and choose the scaling function v so that 


v 2 = o (e) (10-92) 

Notice that v must simultaneously satisfy conditions (10—82), (10—83), and 
(10-92). There are many choices of v which will accomplish this. For example, 
we can take v(e) = e 2/3 . 

We have now obtained the following results: In the outer region, the expan- 
sion to two terms is 


y(x; e) ~ ae~ ft(x) jl + € ^^[r 2 (x) —r' (x)]dx+ . . ,| (10-93) 

In the inner region, the expansion to two terms is 
y(j c; e) ~ e -p °* j[l — ae~ ft(0) ] ^1 (qo~ Pi — ^ j — ae -ft(0) e4 j 

+ ae- n <°^l + e4-^e^+ . . . (10-94) 

And in the overlap domain, both of these expansions take on the common 
value y t given by 

y ( ~ ae- n(0) ^1 + (10-95) 

As in the case of the transition point, we can obtain an expansion which 
is valid everywhere in the interval 0 *£ x *£ 1 (that is, a uniformly valid expan- 
sion) by adding the inner expansion to the outer expansion and, so that it will 
not be counted twice, subtracting their common value in the overlap domain. 
Thus, 


y uniformly valid 1 € J p(x) P (x)]<ir|^ 


-t- e -pox 


j[l-ae- ft «»] 1+ff (qo-pi-^tj -ae~^ eA } + • • • (10-96) 
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Of course, we have not actually proved that equation (10-96) is indeed an 
asymptotic expansion of the solution to equation (10—46) subject to the boundary 
conditions (10-47) and (10-48). However, it is common practice in applied 
mathematics to accept an expansion obtained by a formal procedure (such as 
those given in this section) as being a true asymptotic expansion of the solu- 
tion to the problem. Indeed, most problems which are treated in practice are 
too complicated for rigorous proofs to be carried through. 

We emphasize again that this method applies to nonlinear equations just 
as well as to linear equations. 

10.4 METHOD OF STRAINED COORDINATES 

We shall now consider some additional techniques which can be used to 
handle the nonuniformities that arise when asymptotic solutions are sought 
to certain types of differential equations. Thus, consider the differential 
equation 


^f+o>*y=ey 3 (10-97) 

which describes the oscillation of a mass on a spring with a weakly nonlinear 
restoring force. And for definiteness, let us impose the initial conditions 

y(0) = 1 * (0> = 0 (10-98) 

If we attempt to find an asymptotic solution in the form of a straight- 
forward power series in e 

y(t; e) ~ y 0 (t) + eyi(t) + . . . 

then we get, upon substituting this into equations (10-97) and (10-98) and 
equating to zero the coefficients of like powers of e, 



+ o) 2 yo — 0 


(10-99) 
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d 2 y{ 

+ (o 2 yi = y§ (10-100) 

yo(0) = 1 ^ (0) = 0 (10-101) 

yi (°) = ^ (0) = 0 (10-102) 


However, the solution to equation (10-99) subject to the initial conditions 
(10—101) is yo = cos o)t. And after substituting this into equation (10—100), we 
obtain 


d 2 yi 2 - , , 3 .1 0 

— TT + (0 2 yi — cos 3 (Jit = 7 cos Oit + — cos 3 Oit 
at 2 4 4 

But since the general solution to this equation is 


y i = - 


32 o> 2 


_ , 3 t . 

cos ooit + g — sin (xit + Ci cos (ot + C 2 sin oit 


a) 


the asymptotic solution to equation (10-97) is 


y ~ cos at + e 


f3 t . 
a -si 

|_o (Ji 


sin oit 


32ft) 2 


cos 3<wf + ci cos oit + C 2 sin ait 


+ 


However, the first-order term in this expansion will not be small compared 
with the zeroth-order term when i = 1/e due to the presence of the term 
t sin ait in the solution. Such terms are called secular terms. The expansion 
will, therefore, not be valid for large times even though it may be suitable for 
calculating the solution at small times. The solution of equation (10-97) is 
periodic. However, due to the appearance of secular terms, the solution cannot 
be carried to sufficiently long times to calculate the distortion of the period 
due to the nonlinear restoring force. 

A method (based on the work of Poincare 108 ) for alleviating this difficulty 


108 The ideas were developed in the course of his work on periodic orbits of the planets. 
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was developed by Whitham and by Lighthill (ref. 37). The method depends 
upon introducing a new variable r and then considering both the dependent 
variable y and the independent variable t to be functions of r. Thus, 

t= t{r; e) 

y=y(r; e) 

And we suppose that these functions can be expanded in powers of e to obtain 

t = T + eti(r) + . . . (10-103) 

y = yo{r) + eyiir) + . . . (10-104) 


The arbitrariness introduced into the solution by the functions ti, etc., is used 
to adjust the nonhomogeneous terms in the equations for yi, j 2 , • • • so that 
the secular terms will not occur in the expansion. The resulting distortion of 
the time scale wiD cause the frequency of the motion to depend on €. And this 
will allow us to calculate the variation of the period. 

Now it follows from equation (10-103) that 


d _ dr d 1 f /_j_ \ “ 

dt dt di (1 + et[+. ..) dr 1 dr 


(10-105) 


where the prime denotes differentiation with respect to t. Hence, . 


dr 2 


• ■)£ [(!-«;-■ 





= (1—2 et[ + . . .) 


(Pyo cPyi _ ,, dyo 

dr 2 6 dr 2 € 1 dr 


After substituting this into equation (10-97) together with the expansion 
(10-104) and equating to zero the coefficients of like powers of e, we obtain 


<fiy o 
dr 2 


o) 2 yo = 0 


(10-106) 
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(Pyi 

dr 2 


+ <a 2 yi-y% + 2t[ ^jr- 


,, dyo 
1 dr 


(10-107) 


In order to find the proper boundary conditions, we need expressions for 
the values of y and dy/dt at t = 0 in terms of their values at t = 0. Thus, we use 
a Taylor series expansion of these quantities about r = 0 to obtain 


y(t= 0) = y|„.+ ® t(0) + . . . 

\ar/T=o 


dy_ 

dt 


M 



t( 0) + . . . 

7=0 


where r(0) denotes the value of the function r(t) (obtained by solving equation 
(10-103) for r as a function of t) at the point t = 0. Upon substituting in equations 
(10-103) to (10-105), we find 


dy 

dt 


y(t=0) = y.(0)+«[ w (0) + t,(0) (^) 1 + . . . 

„.=(£)„-[(£ L-w (f L^<o, (f?)j 


+ 


Then substituting these expressions into the boundary conditions (10-98) and 
equating to zero the coefficients of like powers of e shows that 


yo(0) = l 


^ ( 0 ) = 0 

dr 


(10-108) 


y.(0)+«,(0)^(0)=o 
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dyi 

dr 


( 0 ) -<|( 0 ) ( 0 ) + 1 ,( 0 ) ( 0 ) = 0 


or, using the boundary condition (10-108) to simplify the second group of 
boundary conditions, 


yi(0) = 0 


dyi 

d t 


(0)+ti(0) 


d 2 yo 

dr 2 


(0) = 0 f 


(10-109) 


The solution to equation (10-106) subject to the boundary conditions 
(10-108) is 


yo=coscor (10-110) 

This has the same form as the zeroth-order solution obtained by the regular 
perturbation procedure. However, upon substituting equation (10—110) into 
equation (10-107), the latter equation becomes 


d 2 y i 


dr 2 


^ + o> 2 yi = cos 3 (oT — 2(o 2 t[ cos cor — cot" sin o>r 


1 /3 

= — cos 3 <ut + ( — — 2o) 2 t[ 


cos (oi (i)t\ sin cor 


We now determine the function ti(r) so that secular terms will not appear in 
the first-order solution yi. These terms arise because the solutions cos cor 
and sin cor of the homogeneous equation appear in the nonhomogeneous terms. 
Thus, the secular terms will not occur if we choose h so that the coefficients of 
these two terms vanish. Hence, we take t"= 0, t' = 3/8<o 2 , or 


1 1 = 


3 

8co 2T 


( 10 - 111 ) 


where we have set the constant of integration equal to zero in order to simplify 
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the second boundary condition (10-109). And the differential equation for yi 
becomes 


d 2 y i 
dr 2 


+ a) 2 yt =7 cos 3o>r 


Now since substituting equation (10-111) into the expansion (10-103) 
shows that 


t = T + 


8<u 2 


€T + 


we find upon neglecting higher order terms in e that 



And when this is substituted into the zeroth-order solution ( 10 — 110 ), we obtain 


yo = cos 



t 


Thus, we find that the frequency is modified by the nonlinear restoring 
force even in the lowest order term of this expansion. In contrast to this, the 
direct expansion leads to no information about the frequency change. The 
expansion can be carried to higher orders by continuing to use the functions 
t n forn = 2 , 3, . . . to eliminate the secular terms. And this expansion will be 
uniformly valid for all times since no secular terms will appear. 


10.5 MULTIPLE-TIME METHODS 

Equations which contain two disparate time or length scales are frequently 
encountered in practice. This happens, for example, in problems which involve 
a small force acting over a long period of time. An example of this is a spring- 
mass system subjected to a weak viscous damping. The two times which occur 
in this problem are the basic period of oscillation and the damping time which 
occurs over many periods. The basic idea of the method is due to Kuzmark. 
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And a detailed discussion of this method is given by Kevorkian in reference 38. 
The presentation and examples used herein are taken from this report. The 
method takes into account the long-time effects in such a way as to render 
the expansion uniformly valid. As in the preceding example, it involves reason- 
ing about the first-order terms to determine the zeroth-order solution. 

We set up the expansion in such a way that a fast time variable t and a 
slow time variable t are explicitly exhibited. The slow variable is assumed to 
be related to the fast variable by 


i=<f>(€)t (10-112) 

where lim <f>(e)= 0. And we suppose that the solution to the differential 
€—►0 

equation has an expansion of the form 

y(t;e)~vo(e)Mt*,i) + v l (e)Mt*J) + M€)f 2 (t*J) + . . . (10-113) 


where {^(c)} is an asymptotic sequence as e — > 0 and t* is another fast time 
variable related to the fast variable t in such a way that it accounts for the 
change in frequency that occurs in the problem. This relation is taken in the 
form 


t* = t[l + fxi(e)(0i + fi2(e)(t>2+ . . .] (10-114) 

where {/a.j(c)} is another asymptotic sequence and {ooj} is a sequence of 
constants which are determined by the problem in such a way as to render 
the expansion (10-113) uniformly valid. 

The variables t* and Tin equation (10-113) will be treated as independent. 
Hence, the ordinary differential equation satisfied by y will be transformed 
into a sequence of partial differential equations. However, it turns out that 
they can still be treated as ordinary differential equations. 

Upon differentiating equation (10-113) with respect to t, we find that 

dy_ / dy\ dt* | /ay\ dt 
dt \at*/r dt \at A* dt 

But since equations (10-112) and (10-114) show that 
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dt* 

= 1 + fJ-i («) <oi + fii (e) o> 2 + . 



this becomes 

[l + ^i(e)a>x+ . . •] [*'«( € )f^ +I, 1 ( € )f^ + • * •] 

+ <f>(e) |^o(€)^ + ^i(€)-j-r+ . . .1 

L dt dt J 


or 


This equation shows that changes which occur on the slow time scale t are 
small compared with the changes which occur on the fast time scale. 

The method is best illustrated by considering a particular example. Thus, 
consider the equation 


cPy 

dt 2 


f y+ e 



(10-115) 


which governs the behavior of an oscillator with weak cubic damping. We 
shall seek an asymptotic solution which is uniformly valid for 0 t ^ oo ? subject 
to the boundary conditions 


y (0) = 0 (10-116) 

^(0) = 1 (10-117) 

In this case, it is reasonable to begin by choosing the functions <f> and the two 
asymptotic sequences {vj} and {fXj} to be <j>(e)=€ and Vj — (ij = e J . Then 
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t= et and the expansion (10-113) becomes 

y (t; e) =/o (t*,t)+efi (t*,t) + . . . 

with 

t* = t (l + 0) 2 € 2 + . . .) 

where we have omitted the term ewit since we wish to ensure that et only 
occurs in the solution 109 as t. Hence, 


dy 

ay o 

= h € 



dt 

dt* 

\dt 

dt * / 

cPy 

a 2 / 0 

— — l-2e 

a 2 / 0 

4- e * V ' + 0(e 

dt 2 

dt* 2 

dt*dt 

dt* 2 


Upon substituting these results into equation (10-115) and equating to zero 
the coefficients of like powers of e, we find that 


d 2 /o 

dt* 2 


fo=0 


(10-118) 


d 2 /i g d 2 fo 
dt* 2 dt*dt 



l 3 +/,=o 


(10-119) 


And since y(0) =/o(0, 0) + e/i (0,0)+ . . . and 


109 There is a certain amount of arbitrariness in the choice of the variables t* and t. 
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Now the solution to equation (10-118) is 

fo ( t *, t) = Co(t) sin D 0 (£) cos t* (10-122) 

where the functions Co(t) and Do(t) are arbitrary functions arising from the 
integration. But substituting equation (10-122) into the boundary conditions 
(10-120) shows that 

Z) o (0) = 0 C o (0) = l (10-123) 

As in the preceding method, we now determine the functions Co and Do 
so that the first-order solution f\ will not contain any secular terms. To this end, 
notice that equation (10-122) implies that 

(d£) 3= (Co C0S l *~ Do sin **) 3=C o cos 3 t* — SDoCl sin t* cos 2 t* 

+ WICo sin 2 t* cos t*-D% sin 3 t* 

Hence, we find, upon using the identities 
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cos 3 t * = 7 cos 3 1* + 7 cos t * 

4 4 

sin 3 t* = — ^ sin 3 1* + ^ sin t* 

sin t* cos 2 t* = sin t* — sin 3 t* 
sin 2 t* cos t*= cos t* — cos 3 t* 

that 

(ff^) 3== 4 Co(C% + D$) cosf*-|flo(C§ + D 2 ) sin t* 

+ ^D 0 (D 2 -3C 2 0 ) sin3f*+|c 0 (C 2 -3Z> 2 ) cos 3t* (10-124) 

We therefore find, after substituting equations (10-122) and (10-124) 
into equation (10-119), that 

|^+/, = [|o„(C5 + C§)+2^!] s in (*-[|Co(DS + CS)+2-^-°]cos(* 

- jOo(OS-3Cg) sin 3l*-|c,(q-3D5) cos 3l* (10-125) 

In order to ensure that secular terms do not occur in /i, we must eliminate 
sin t* and cos t* from the nonhomogeneous term of this equation. This can be 
accomplished by putting 


2^+|o„(OS+CS)=o] 
dt 4 j 

2^+-C„{D%+C%) = o\ 
at 4 9 


(10-126) 
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Upon multiplying the first of these by Do and the second by Co and adding the 
results, we get 


-4(DJ+CJ)+f (DJ+CJ)«- 0 

at 4 


which has the solution 


D$+Q= — 


Z t+K ' 


where Ki is a constant. And substituting this into the boundary condition 
(10-123) shows that we must take K\ = 1 to obtain 


Dl+Cl= 


1 


|t + l 
4 


(10-127) 


But substituting this into the first equation (10-126) shows that 


o dDo , 3 Dp 

Since the solution to this equation subject to the first boundary condition 
(10-123) is D 0 (t) = 0, equation (10-127) becomes 


C 0 



where the positive square root is taken to ensure that Co satisfy the second 
boundary condition (10-123). Substituting these results into the zeroth-order 
solution (10-122) now shows that 
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hit*, t) 


sin t* 

V!' +i 


Hence, we obtain the one-term uniformly valid asymptotic expansion of 
the solution to equation (10-115) 

sin t* _ sin t 

V!* +i V i+ ! ef 

The procedure can be continued to obtain higher order terms, the solution to 
any given order being determined by reasoning about the next higher order 
terms. 


i 
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Numerical Methods 


It frequently happens that the differential equations encountered in prac- 
tice cannot be solved by the exact and approximate methods discussed in the 
preceding chapters. In such cases it is often necessary to resort to numerical 
methods in conjunction with a digital computer. These methods usually involve 
replacing the differential equations by a number of algebraic equations, called 
difference equations, in such a way that the solution of the difference equa- 
tions is in some sense close to that of the differential equation. There are a 
large number of numerical procedures available. The choice of method is 
influenced by the type of auxiliary conditions as well as by the form of the equa- 
tion. Thus, when all the auxiliary conditions are imposed at a single point 
(initial conditions), the solution can be developed by means of “marching 
techniques,” which solve the difference equations in succession. These 
marching solutions can be carried out either by using implicit methods such as 
Euler’s method and the Runge-Kutta method or by using explicit methods 
such as the Adams method and the improved Euler method. Each of these 
methods has advantages and disadvantages which will be discussed sub- 
sequently. When auxiliary conditions are imposed at two points (boundary 
conditions), it is usually possible to solve linear equations directly by using 
either the superposition principle or finite difference schemes which involve 
matrix methods. However, when the equations are nonlinear, it is often neces- 
sary either to linearize the problem or to reduce it to a set of initial-value 
problems and then use iterative or matrix techniques to obtain the solution. 

The subject of numerical solutions to differential equations is quite vast 
and we cannot hope to cover it completely in a single chapter. For more 
detailed information, the reader is referred to references 39 to 42. 
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11.1 APPROXIMATION BY DIFFERENCE EQUATIONS: ERRORS AND INSTABILITY 

Consider the general system of nth-order differential equations 

F (y (n) , y (n_1) , . . . , y, x) = 0 (11-1) 

defined on the interval a ^ x < b and subject to appropriate initial or boundary 
conditions. 

In order to obtain a numerical solution to this system we first partition 
the interval a ^ x ^ b. A partition of the interval a ^ x ^ b is defined to be 
any finite set of points x\, xi, . . ., x m +i which has the property that a=x i < 
X 2 < . • • <x m +i = b. The length hj of the yth subinterval, 110 xj x *j+i, is 
called the step size. Thus, 


hj = Xj+ i~Xj for y = 1, 2, . . ., m 

The system of differential equations (11-1) is then “replaced” by a set 
of algebraic equations, say 

Gfc(y t , . . . , y m +i j Xi , • • • , Xm+ 1 ) 0 for k 1,2, . . . , p (11 2) 

called difference equations. Their solution is the set of m + 1 vectors yj , . . . , 
ym+i, which are approximately equal to the values taken on by the solution 
y=f(jc) of the system (11 — 1) at the m + 1 points Xi, . . .,x m +i. Before discussing 
the various methods whereby such difference equations can be constructed, 
we shall first consider certain types of errors which can occur when difference 
equations are used to obtain numerical solutions. 

The discretization or truncation error Ei at the ith step is defined to be 
the magnitude of the difference between the solution to the differential equa- 
tions at the point Xi and the solution yi of the difference equations. Thus 

Ei = | f (xi) yi | 

This error depends only on the type of difference equation used and is inde- 
pendent of the method by which it is solved. 


1,0 It is not necessary to have all hj equal, but it is usually desirable. 
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However, there is also an error which is caused by the numerical procedure 
itself. Because a computer can accommodate only a limited number of sig- 
nificant figures, it cannot store accurately an irrational number or even a 
rational number requiring precision beyond the computer’s capability. There- 
fore, the computers themselves introduce an error which results from the 
necessity of rounding off numbers: it is known as the round-off error. The 
round-off error at any step in the computations propagates to the next step and 
is combined there with the round-off error of that step. The generation of the 
round-off error at each step is extremely unpredictable. Precisely for this 
reason, analyses of round-off error often treat the error per step as a random 
variable (see ref. 43). 

Closely related to the question of error is the question of stability, which 
must be considered in certain instances before a numerical solution can be 
obtained. Various types of instability can arise. If the instability is inherent 
in the differential equation itself, it is called an inherent instability. For ex- 
ample, consider the initial- value problem 


(11-3) 


subject to the initial conditions yi (0) = 1 and y 2 (0) = — 10. The general solution 
for equations (11 -3) 


dy i 


d ± = 100 *' 


yi(x) = C 1 e -10x + C 2 e 10x 
y 2 (x) = - 10Cie -10x + 10C 2 e 10x 

And the initial conditions are satisfied by taking Ci = 1 and C 2 = 0. However, 
when a numerical procedure is used, it will usually be impossible to satisfy 
these initial conditions exactly. But a small error in determining the constant 
C 2 will then allow the second terms in the solutions to eventually grow so large 
that they will dominate the first terms, which correspond to the solution being 
sought. Situations of this type arise most frequently when the initial-value 
problem is being solved as part of an iteration procedure to solve a boundary- 
value problem. 
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Instabilities can also arise from the difference equations (even when the 
differential equations are stable). They are then called induced instabilities. 
These instabilities can result in spurious solutions to the difference equations 
which do not correspond to solutions of the differential equations. For more 
details the reader is referred to reference 43. 

11.2 INITIAL-VALUE PROBLEMS 

In this section we shall consider certain types of “difference schemes” 
which are suitable for obtaining numerical solutions to initial-value problems. 

11 . 2.1 Explicit Methods 

11.2.1.1 One-step processes: Taylor series method. — It has been indicated 
in chapter 3 that any normal system of ordinary differential equations can 
always be written as a first-order normal system, which in vector notation has 
the form 


f=G( X ,y) (11-4) 

We shall now consider some methods, referred to as one-step processes, 
for obtaining numerical solutions to this equation on an interval a ^ x^b 
subject to the initial conditions y=y x at x = a. For any given partition of 
a «£ x *£ b, say*i, . . ., x m +i, we can seek to approximate the solution y=f(*) 
of the system (11-4), subject to these initial conditions, by replacing this 
equation by the set of algebraic equations (or difference equations) 

yj+i = yj+G(*j, yj)hj for j=l, 2, . . .,m (11-5) 

It can be seen that, starting with the value yi, equation (11—5) can be used to 
calculate y/ successively at the points j—2, 3, . . ., m+1. We hope that 
the vectors y j will provide good approximations to the values f(*j) of the solu- 
tion to equation (11-4) at the points Xj. 

The discretization error which occurs when using a difference equation 
to integrate a differential equation across a single step, T(xj, hj), is called the 
local truncation error or the truncation error per step. Thus, the truncation 
error per step is the error induced by using equation (11-5) to calculate 
f (xj+hj) approximately from the value f(*j) = yj or, symbolically, 
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T(xj , hj ) = | yj+i-f (xj+hj) \ 

Hence, upon substituting in equation (11-5) with yj = f{xj), we get 
T(xj, hj) = | f(xj) - f (xj+hj) + G(xj, f(xj))hj | 

But Taylor’s theorem shows that 111 

f(xj + hj) = f(xj) + (£) je hj+0(hp 

Hence, 

T(x i ,h,)= ftj + 0(A?) 

But since, by hypothesis, f(Ac) satisfies the differential equation (11-4), the 
first term vanishes and we obtain 

T(xj , hj ) = 0(hj ) ashj—*0 (11~6) 

More generally, we can attempt to approximate the solution to equation 
(11-4) by means of a difference equation of the form 

yj+i = yj + ®(xj, yj; hj)hj forj= 1, 2, . . . , m (11-7) 

where the function is to be chosen so that in some sense the solutions to 
equation (11-7) provide good approximations to the solutions of equation 
(11-4) for sufficiently small step size hj. In order that this be the case, we 
certainly must require that 

OO*:, y; h) -*■ G(*, y) ash^O (11~8) 

If we again let y=f(jt) be a solution to equation (11—4), the truncation 
error per step T(xj, hj) is 


111 The concept of order and the symbol O are introduced in chapter 9. 
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T(xj, hj) = | {(xj ) — f (xj + hj ) + $>(xj , f (xj ) ; hj)hj | 

The same argument as was used in obtaining equation (11-6) used to- 
gether with condition (11—8) now implies that 

T(xj ; hj) — o ( hj ) as hj-+0 ( 11-9) 

This shows that the truncation error per step goes to zero faster than the 
mesh size hj. Other things being equal, it is of course desirable to have the 
truncation error per step approach zero at the fastest possible rate as hj — > 0. 
In order to have some measure of this rate we define the order of a given 
difference scheme to be the largest number p such that 

T(xj ; hj ) =0(hf +1 ) as hj — > 0 

Equation (11-6) shows that the order of the difference scheme (11-5) is 1. 
And, more generally, equation (11—9) shows that the order of any difference 
scheme of the type (11-7) which satisfies condition (11-8) is greater than 
zero. The numerical method corresponding to the difference scheme (11-5) 
is called Euler’s method. Although this method is very simple, it is prone to 
round-off errors and is therefore infrequently used. 

The truncation error per step can be used to obtain a bound on the (cumu- 
lative) truncation error for difference equations such as equation (11-7), that 
is, difference equations which determine the solution at the point Xj + 1 only 
in terms of quantities from the preceding step. Instead of considering the 
general system (11-4), we consider, for simplicity, only the single differential 
equation 


fx = G(x ’ y) 


( 11 - 10 ) 


Suppose G(x , y ) satisfies the Lipschitz condition with respect to y 
\G(x, y) - G(x, £) | M\y - £| 
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and let 


T(xj, hj) f o 

A = max tor; = 1, 2, . . m 

hj 

Then it is shown in reference 4 (p. 181) that the truncation error Ej is, at most, 

A ( e |x r x,|iw_ !) 


that is, 


- 1 ) 

Now for any difference scheme of order p, there exist constants Dj inde- 
pendent of the mesh size hj such that 


r(*j, hj) 

hj 


Djhf 


And if we put 


e = max hj 
D = max Dj 


for j=l,2 . . m 


then A =S De p and the truncation error satisfies the inequality 
DfP 

Ej =£ — 1) for 7=1,2,. . ., m 

which shows that the (cumulative) truncation error is of order e p when the 
difference scheme is of order p. 
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In principle, it is easy to derive formulas for numerical integration of 
the system of equations (11“4) which are of an arbitrarily high order. This 
can be accomplished, for example, by the method of Taylor's series. Instead of 
applying this method to the general system of first-order equations (11-4), 
we shall again, for simplicity, consider only the single first-order equation 
(11-10). There is no difficulty in extending the ideas to the general system 
( 11 ^). 

Let y—f(x) be a solution to equation (11-10) and let g(x,y) be any r-times 
continuously differentiable function of x and y. Taking the total derivative of 
g with respect to x along the curve which is obtained by plotting y = f(x) gives 

^dg^dg dgdy = dg dg^ 

g* dx dx dy dx dx dy 


Applying this formula to the function gi(x, y) gives 


<Pg dgi _ dgt r dgi _ / d c d \ (d_ 
dx 2 dx dx dy \d:r dy) \dx 


+cf-u-(^+G^y« 


dy 


dx 


dyj 


and, in general, we obtain 


dfg = dgn-l _ dgn-1 . r dgn- 1 _ ( d_ r d_\ n 

dx n dx dx dy y)* dy) 


for n = 1 , 2 , . . . , r 


Thus, in the special case when g(x, y) =G(x, y), this equation becomes, in 
view of the differential equation (11-10), 


d 2 y 

dx 2 


= G x + GGy 




— G X x~^~ 2GG X y~^ G 2 G yy + G x G y + GG 2 ( 11 — 12 ) 

dx 3 
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cf n+1 y 

dx n+1 


( 


i-+c± 

dx dy 


) 


G 


(11-13) 


d r+1 y 

dx r+1 


(t +G h) 


Now suppose that G is p — 1 times continuously differentiable and put 

| <^U +C a? ) c di-14) 


Then, at least in principle, <I> can be calculated for any integer p simply by 
differentiating the given function G. Hence, when y=f(x) is a solution to equa- 
tion (11-10), it follows from equation (11-13) that 

*(*,/(*); A) = (11-15, 


Thus, the truncation error per step, which is incurred when the difference 
equation 


yj+i = yj + < &(xj, yji hj)hj for 7 = 1 , 2 , . . .,m (11-16) 

is solved to obtain an approximation to the exact solution, y=f(x), of the 
differential equation (11-10), is 


T(xj, hj) = | f(xj) -f(xj + hj) + (xj,f(xj ) ; hj)hj | 


But inserting equation (11-15) shows that 


T(x j, hj) = 


f(*j ) —f(xj + hj) + 


X 


n=0 




(n+ 1) ! 
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Hence, upon applying Taylor’s theorem we find that 

T( Xj , hj) = 0(hf +1 ) 

which shows that the formula (11-16) with <1> determined by equation (11-14) 
is of order p. The method of Taylor’s series is efficient for linear systems or 
even for equations where G is a polynomial of low degree in x and y. However, 
as can be seen from equations (11-11) and (11-12), the method usually becomes 
extremely complex. In order to avoid this complication due to the successive 
differentiation and at the same time to preserve the increased accuracy which 
is afforded by using the Taylor’s series method, a technique introduced by 
Runge, Kutta, and Heun known as the Runge-Kutta method can be employed. 
In this case, the function O in equation (11-7) is taken to be of the form 

*(x h y j ;h j )= 2 a*k s (11-17) 

1 

where 

ki = G(*j, yj) (11-18) 


and 

k« = G (xj+p-shj, yj + hj j fors = 2,3, . . ., r (11-19) 

For any given integer r, the parameters a s , p s , and \ s , n are to be determined in 
such a way that the order p of equation (11-7) is as large as possible. 

For simplicity, we shall again restrict our attention to the single first-order 
equation 

^ = G(x,y ) a^x^b (11-20) 

In this case, equations (11-7) and (11-17) to (11-19) reduce to 

yj+i = yj+®(xj, yj; hj)hj for;=l,2, . . ., m (11-21) 
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<D = ^a s k s (11-22) 

5—1 

k 1 = G(x j , yj ) (11-23) 

kg G ftghj, yj+hj ^ \g — i f n kjij for s = 2, 3, . . r (11—24) 

And if y=f(x ) is a solution to equation (11-20), the expression for T(xj, hj ), 
the truncation error per step, reduces to 

T(xj, hj) = \f{xj) —f(xj + hj)+<t> (xj, f(xj) ; hj)hj\ (11-25) 


Applying Taylor’s theorem to the function T(xj, hj) gives 


T(xj, hj) = 


V 1 \ d m T(xj, h) 

io m! L dhm 


hT + OihP* 1 ) 

_ h = o 1 3 


But differentiating equation (11-25) m times gives 


[ d m T(xj,h)l _ / d m - 1 $\ / d m y\ I 

[ dh m J A=0 “ m UrV* r o W m /x=xJ 


Hence, the method will be of order p, provided that 


m 


/a m ~id) \ /d m y\ f= 0 

\ dhj 1 1 ) hj= o \dx m /x=Xj 1^0 


for m= 1,2, . 
for m = p + 1 


(11-26) 


If we substitute equation (11-13) and equations (11-22) to (11-24) into equation 
(11-26) and equate to zero the coefficients of all the independent partial 
derivatives of G, we will get a set of nonlinear equations for a s , p s , and X*,„. 
For any given value of r, there will be a largest value of p for which these equa- 
tions can be solved. For 1 ^ 4, this value of p turns out to be equal to r. 

There is a certain arbitrariness in the solutions of these equations for the 
constants a*, p s , and \ s ,n. Thus, for r— 2, one of these constants can be chosen 
arbitrarily; and for r=3 and r=4, two of the constants can be chosen arbitrar- 
ily. The computations are carried out for the general case in reference 41. 
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Here we shall merely illustrate the method by considering the case where 
r= 3. Then equations (11-22) to (11-24) reduce to 

<t> = a 1 k 1 + a 2 k2 + a 3 k3 (11-27) 


ki = G(xj,yj) 

k 2 = G{xj + yj + hj\ U i ki) 

k 3 = G(xj + (x 3 hj, yj + hjk 2 , i ki + hj \ 2 , 2 k 2 ) 


(11-28) 


Expanding ki, k 2 , and k 3 in a Taylor series about hj = 0 and retaining terms 
only up to hj give 


ki = G (xj, yj) 


\ 


k 2 — G ( Xj , yj) + hj (f^ 2 G x + Xi, i GG y ) x =xj 

V=Vj 


+ h] ii 2 G X x + [*2^1, iGGxy + "2 - ^i, ^Gyy'j ^ +0(/ij) 

y=vj 


k 3 — G {xj,yj)+hj (im 3 G x + k 2> iGG y + \ 2 , 2 GGy)x=3x 

y=vj 


+ hj |^-|-/a§Gxx+ fa (^ 2 , i + h 2 , 2 )GGxy+ ~2 (h 2 , i + h 2 ' 2 ) 2 G 2 G y y 

+ X 2 ,2 (M2^x+ hl'iGGy) Gylx=X, + 0(/l?) 

Jy=yj 


K 11-29) 
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Now it is clear that 


1 /d m Q\ 
ml \dhf) hj =o 

is the coefficient of ( hj) m in the Taylor series expansion of <l> about hj— 0. 
Hence, when equations (11-29) are substituted into equation (11-27), the 
functions 


1 /d m 4>\ 
ml \dhf) hr o 

for m = 0, 1 , 2 are simply the coefficients of h f, hj , and hj, respectively, in the 
resulting expression. Thus, 

<*> (xj, yy, 0) = («i + a 2 + as) G (xj, yj) (11-30) 

= Ct 2 ((A 2 G X -\- A-l, lGGy)x=Xj + &3 [^3 Gx~h ( A- 2 , 1 T" ^ 2 , 2 ) GGy] x =Xj (11—31) 
hj = 0 y=vj v=Vj 

1 /a 2 $\ /1 1 \ 

2 \Bhj )hj= 0 ~ az \ 2 ^3 G xx+fx 2 h 1 , 1 GG xy + 2 Xf.i &Gyyj x=x . 

V=Vj 

^ 2 fX 3 G XX+ 1^3 (X2, 1 + X2, 2) GG X y-\- 2 (X2, 1 + h2 > i) 2 G 2 Gyy 

+ A.2,2 (/^Gx+ ^1, lGGy) Gy\ (11-32) 

JX-Xj 

y=vj 



On the other hand, equations (11-26) show that 
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2 (— \ — (d-¥\ 

\dhjj hj=0 \dx 2 )x=Xj 

= (d*y\ 

6 \dh] ) hj=o \dx :i ) x =xj 


Upon substituting equations (11-11), (11-12), (11-20), and (11-30) to (11-32) 
into these relations, we find that the resulting equations are satisfied identically 
in Xj and yj only if the constants a s , fx s , and X s , n are chosen so that the coeffi- 
cients of all the independent derivatives of G vanish. This will occur if, and only 
if, the constants satisfy the following algebraic equations: 


ai + «2 + ct 3 = 1 


fx 2 oc 2 + jx 3 ot 3 = - 


Xi,i« 2 + (A.2.1 + X 2 ^)a3 — 2 A*.|a 2 -f n\a 3 — — 


P'2Xi,ia2 + ^3 (X 2 ,i + X 2j2 )a!3 — ^ 


Xf 1 CC 2 + (X 2 ,i + X 2) 2) 2 a3- g 


/x 2 X 2 , 20 : 3 — ^ 


Xi, iX 2 ^ 0 : 3 — ^ 


These equations imply that 

X 14 = jx 2 


X 2 ,l+ X 2,2 — fx 3 


and there are four independent equations which must be satisfied by the re- 
maining six unknown constants. Hence, two of these constants can be chosen 
arbitrarily. 

The most frequently used Runge-Kutta method is of the fourth order. 
The values of the constants a s , fXs, and X S) „ for the general vector equations 
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Table 11— 1. — Choice of Parameters for 
Fourth-Order Runge-Kutta Method 


Parameters 

Standard 

Runge-Kutta 

method 

Kutta’s method 

«i 

1/6 

1/8 

Ot 2 

1/3 

3/8 

<*3 

1/3 

3/8 

a 4 

1/6 

1/8 

M2 

1/2 

1/3 

M3 

1/2 

2/3 

^4 

1 

1 

X 14 

1/2 

1/3 

X 2 ,1 

0 

-1/3 

^2,2 

1/2 

1 

^3,1 

0 

1 


0 

-1 

^3,3 

1 

1 


(11-17) to (11-19) for the fourth-order Runge-Kutta method are listed in table 
11-1 for two choices of the arbitrary constants, and because of its importance 
the complete formulas for the standard Runge-Kutta method are also listed: 

yj+i = yj+ (kj + 2k 2 + 2k3 + k4) 


ki = G(xj, yj) 

k 2 = G ^ hj, y j + | hjki 'j 

k 3 = G ^ xj+ | hj, yj+ | hjk 2 ) 

k4 = G(xj+ hj, yj + hjks ) 


There appears to be only a slight advantage which can be gained by changing 
the choice of the arbitrary parameters. 
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Although the Runge-Kutta method involves fairly simple formulas, it 
has certain disadvantages. Thus, (1) the method is limited to the fourth or 
fifth order; (2) if the function G is complicated, the evaluation of the k* for 
s= 1, 2, 3, 4 at each mesh point can be quite time consuming; (3) it will calcu- 
late a solution even at points of discontinuity without giving any indication that 
this has been done; and (4) there is no readily obtainable error analysis. 

The lack of any error analysis for the fourth-order Runge-Kutta method 
can be partially compensated for by using certain rules of thumb. Thus, for 
example (see ref. 41), if the quantity 

k 2 — k3 
kj — k 2 

becomes much larger than a few hundredths at any point xj, the step size 
hj should be decreased. 

11.2.1.2 Multistep processes: Finite differences. — Up to this point we have 
been discussing one-step difference equations, that is equations which deter- 
mine the value of the dependent variable at the step Xj+i completely in terms of 
its value at the preceding step xj. There are other types of difference equations 
which can be used, called n-step equations, 112 which utilize the values of the 
dependent variable at the first n preceding steps, say xj, xj-i, . . ., xj- n +t, 
to determine its value at the step Xj+ 1 . 

Before discussing these difference equations it is first convenient to in- 
troduce the concept of difference operator. Thus, the difference operators 
A, V, and 8 corresponding, respectively, to the forward difference, the backward 
difference, and the central difference are defined by 

A y(x) = y (x + h) -y(x) 

Vy(x) = y(x) —y ( x — h ) 

8y ( X ) =y(x+-j)-y 

If the function y{x) is defined only on a finite set of points, say x\ < X 2 < . . - 
< x m +i withxj+i — *j= h for j=l,2, . . ., m, we shall sometimes write 



112 The associated numerical procedure is referred to as an n-step process. 


334 


NUMERICAL METHODS 


yj = y(xj) = y (xi + O'- 1) h) 

Then the notation for the first two difference operators becomes 

kyj—yj+i—yj v yj=yj-yj-i 

These operators arise in approximating the derivative of functions. The 
nature of this approximation can be seen from the relations 


a— o n *-» h a- o Ah dx 


Applying these operators twice in succession gives the second differences 
A 2 y{x) = A [Ay(*)] = A [y (x + h) — y(x)] =y (x + 2h) —2y (x+ h) -t-y(Ac) 


V 2 y(x) =y{x) —2 y (x — h) + h (x — 2 h) 


8 2 y(x ) =y (x+ h) — 2y (x) + y (x — h) 

These formulas can be used to provide approximations to the second deriva- 
tive since 


d 2 y(x) 

dx 2 


A 2 y(x) 
urn — tz — 
*-« h 2 


lim 

h~* 0 


V 2 y(jc) 

h 2 


lim 

A — >0 - 


8 2 y(x) 

h 2 


There are various manipulations that can be performed with these and other 
difference operators which are sometimes useful for obtaining finite difference 
equations from differential equations. A fairly detailed discussion of this is 
given in Hildebrand (ref. 44). 

Now consider the general nth-order differential equation in normal form 


l-r( dl 
" G Y*’ y ’ dx ’ 


d n y_ 
dx 


d n ~ 1 y\ 

’ dx»~' ) 


a ^ x^S b 


(11-33) 


335 



DIFFERENTIAL EQUATIONS 


Suppose that the interval a^x^b has a partition Xi, x 2 , . . x m +i for which 
all the subintervals Xj *£ x Xj + 1 have the same length h. Upon replacing 
each derivative ( d k y)l(dx k ) for A: = 1 , 2, . . n in equation (11-33) by its 
forward difference approximation h~ k A k yj, where jj is the approximation to 
y[a+ O'-l) h\ — V (%j) » we obtain the difference equation 

An y .= h n G (x h yj,±-Ay h . . . , ^ (11-34) 

However, since is a linear combination of yj, y J+ i, . . yj+k, this equation 
is essentially of the form 

yj +m =<f>{j, yj, yj+i, . . ., Yj+m-i’, h) 


which is clearly an n-step difference equation. 

In fact, even first-order normal differential equations can lead to n-step 
difference equations. Thus, consider the differential equation 


dy 

dx 


= G(x, y) 


and approximate the derivative by the central difference (yj+i yj-i)/2h to 
obtain the difference equation 


yj+i = yj-i + 2hG(xj, Yj) 


o 

which is clearly a two-step equation. 

Notice that, in order to start the solution of an n-step method, we must 
have a prior knowledge of the values of yi, yi , y$, . • Yn, that is, the values 
(or approximate values) of the solution at the first n mesh points. A one-step 
method requires only a knowledge of the initial value yi. Hence, when n > 1 
it is in many instances necessary to introduce some auxiliary method for deter- 
mining these values. The one-step methods are, therefore, said to be self- 
starting. In a one-step method, the mesh size hj = Xj+i — Xj can be varied at 
each step. With a multistep method, it must usually remain fixed. 


336 


NUMERICAL METHODS 


11.2.2 Implicit Methods: Predictor-Corrector 

In the methods discussed so far, the value of the dependent variable 
yj + 1 at the step Xj+i is determined explicitly in terms of its values at one or 
more preceding steps. We can, therefore, calculate the values of the dependent 
variable recursively. Hence, such methods are called explicit methods. There 
are other methods, however, in which the formulas for calculating yj+i from 
the values of the dependent variable at the preceding steps are not solved 
explicitly for yj+i but determine this variable only implicitly. Such methods 
are, therefore, called implicit methods. 

In order to see how difference equations of this type arise, let 

a = X i<X2<. . .<x m +i = b 

be a partition of the interval x^ b; suppose that each subinterval xj ^ x ^ 

f b 

Xj+i has the same length, say h; and consider the integral I f(x)dx. Recall 

that this integral can be evaluated numerically in an approximate fashion by 
using either the trapezoidal rule 

fb mi 

f(x)dx LA*j+i) +f(*j)]h 
Ja 2=\ ** 


or by using Simpson’s rule 

fb m-l 1 

f{x) dx « Y r If (xj+ 2 ) + 4/ Uj+ 1 ) +/ ( Xj ) ] h 

Ja j=1 o 

Now, consider the first-order differential equation 

y- = G(x, y) (11-35) 

dx 

Integrating both sides of this equation first between Xj and Xj+i and then between 
Xj and Xj +2 gives, respectively, 
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yj+i=yj+ 


f J+1 G(x, y(x))dx 
Jx i 


(11-36) 


and 


yj+2=yj + 


j+2 G(x, y(x))dx 


(11-37) 


Upon using the trapezoidal rule for evaluating the integral in equation (11-36) 
and Simpson’s rule for evaluating the integral in equation (11-37), we get the 
following finite difference equations for approximating the solution to equation 
(11-35): 


yj+i = yj+ [G(*j+i, y j+ i) + G(xj, yj)] - 


(11-38) 


and 


yj+2 = yj + [G(xj+ 2 , yj+t) + 4G(xt+u yj+i) + G(xj, yj )] | (11-39) 

Notice that in the first of these equations, y/+i appears not only explicitly 
but is involved implicitly through G on the right side. In general, it will not be 
possible to solve this equation to obtain an explicit formula for yj+i. Similar 
remarks, of course, apply to the second equation. If, instead of treating the 
single first-order normal equation (11-35), we consider the general normal 
system 


*-G(,.7) 

dx 

the same arguments would lead to the difference equations 

h 

yj+i = yj+ [G(*j+ 1 , y-j+i) + G(x/, yj)] - 


(11-40) 


(11-41) 
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and 


yj +2 = yj + [G(x j+ 2 , yj+ 2 ) + 4G(^j+i, yj+i) + G(*j, yj) ] | (11-42) 

Notice that, in equation (11-41), y J+ i is determined completely in terms of 
its value at the preceding step; whereas, in equation (11-42), yj +2 is deter- 
mined in terms of the values of the dependent variable at the preceding two 
steps. Since these formulas cannot usually be solved explicitly for the de- 
pendent variable, it is usually necessary to resort to an iteration process at 
each step to find this variable. Thus, let 


U (yj+i) = y,+ [G (x j+ 1 , 


yj+i)+G (x,-,yj)]y 


Then equation (11-41) can be written as 

y j+ i = U (y j+ i) (11-43) 

Suppose that by some means a fairly good guess at the solution yj+i of equation 
(11-41) can be made, say y^,. Substituting this into equation (11-43) gives 
a better approximation y^*), to the solution, given by yjj^ = U (yj^, ) . 

Proceeding in this manner, we obtain the sequence of approximations 

yiV. = U (yj®*,), ^, = U(y5«,), = U (?<») 

which we hope will converge fairly rapidly to the solution of equation (11-41). 
A good choice of the initial approximation can be obtained by using Euler’s 
formula, equation (11-5) , to obtain 


yj+i = yj + G Uj, yj) h ( 1 1-44) 

In practice, instead of solving equation (11-41) accurately for y J+ i by 
performing many iterations, we can obtain the same accuracy with much less 
work by taking a finer mesh size h and performing only one or two iterations. 
If only one iteration is performed, the method is called the improved Euler 
method. In this case, we first compute the vector Y j+1 , called the predictor , 
from the formula 
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Yj+i = yj + G (xj, yj) h ( 1 1-45 ) 

called the predictor formula, and then substitute it into the formula 

yj+ i = yj+ [G (x j+ u Yj +1 ) + G (Xj, yj)] y (11-46) 

called the corrector formula, to determine yj+i. Thus, in effect, yj+i is cal- 
culated from yj in two steps instead of one. Of course, we can apply the same 
procedure to equation (11-42). This leads to Milne's method. 

These two methods are examples of the predictor-corrector methods. 
Other predictor-corrector methods differ from these only with respect to the 
polynomial interpolation formulas from which the predictor and corrector 
formulas are derived. A commonly used predictor formula is the Adams- 
Bashforth formula (see ref. 45) 

Yj+i = yj 4- |j- (55Gj - 59Gj_i + 37Gj _ 2 - 9Gj_ 3 ) 

where we have put Gj = G(.*j, yj). This formula is most frequently used in 
conjunction with the A dams-Moulton corrector formula (see ref. 45 ) 

yj+i = yj + 24 PG(*j+i> Yj+i) + 19Gj — 5Gj_i + Gj_ 2 ] 

These formulas have a higher order of accuracy than the Euler’s formulas. How- 
ever, they are not self-starting. Also, unlike the Runge-Kutta method they 
cannot be easily used alone with a variable mesh size. These difficulties are 
frequently alleviated in practice by using the Runge-Kutta method to obtain 
the starting values and also to compute the solution for the first few mesh 
points after the step size has been changed. However, the predictor-corrector 
methods can, in the case of complicated equations, result in a considerable 
savings in computer time over the Runge-Kutta method. In addition, it is 
usually possible with predictor-corrector methods to monitor the error as the 
calculation proceeds. 

Another difficulty with the predictor-corrector methods is that in some 
cases, they are subject to certain types of instabilities which do not occur 
when the Runge-Kutta method is used. This instability manifests itself first by 
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resulting in an error which is larger than expected; and when one attempts to 
reduce this error by decreasing the step size, the error actually increases. A 
more detailed but elementary discussion of this instability is given in reference 
45. 

11.3 BOUNDARY-VALUE PROBLEMS 

11 . 3.1 Linear Equations 

11 . 3 . 1.1 Use of superposition . — The methods discussed previously have 
all been methods for solving initial-value problems. However, in the case of 
linear equations, these methods can be used in conjunction with the super- 
position principle to obtain solutions to boundary-value problems. Thus, in 
order to solve a boundary-value problem for a linear differential equation, we 
need only solve numerically the same number of initial-value problems as 
the order of the differential equation, provided these problems are chosen in 
such a way that their solutions are linearly independent of one another. It is 
easily seen from section 1.6 that this can always be done by choosing the 
initial conditions of these problems so that they have a nonzero Wronskian 
determinant. Then any boundary-value problem can be solved by forming a 
linear combination of these solutions with the constants adjusted numerically 
to satisfy the imposed boundary conditions. This is a particular example of how 
some a priori knowledge of the properties of the solutions of the equations to 
be solved can be utilized to simplify the numerical procedure for obtaining 
these solutions. 

11.3.1.2 Finite differences . — Another method for solving linear boundary- 
value problems is the method of finite differences. In order to illustrate this 
method, let us consider the second-order linear equation 

3-7 + p(x) q(x)y= r(x) a «£ x 6 (11-47) 

dx 2 dx 

subject to the boundary conditions 


y( a ) =A 

y{b) =B 


(11-48) 
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Let a = x o< xi < . . . < x m +i = 6 be a partition of a x =£ b with equal 
mesh size h. 

Upon approximating the derivatives by appropriate central differences, 
we obtain the following difference-equation approximation to equation (11-47) 


yj + 1 - 2 yj + yj-i yj+i -yj - 1 


2h 


2 h 


Pj + yjQj = n for j = 1, 2, 


m 


where we have put pj = p(xj), qj = q(xj ) , and rj = r(*j). This can be written 
as 

(i - | Pi ) y,_i + (h 2 qj - 2) yj + (l + f Pj ) W+1 = for;=l,2,. . .,m 

Upon using the boundary conditions (11-48) to replace yo by /I and y m +i by 
fi, we obtain the following set of m equations in the m unknowns yi, j 2 , • • • , 
y m : 

(h 2 q l - 2) yj + ^1 + | pi ^ y 2 = /i 2 n + ( | Pi — l) ^ 

(l — |/> 2 ^yi+ (h 2 qi — 2) y 2 + (^1 + | p 2 J y3 = h 2 r 2 

^1 — | p m -ijym- 2 + (h 2 q m - 1 — 2) y m -i + ^1 + | Pm-i j y m.~ h 2 rm—\ 

^1 — | Pm j ym-i + (h 2 qi~2) y m = h 2 r m — ^1 + | Pmj B 

This equation can be written in matrix form as 

My = c (11-49) 

where y is the vector y= (yi, y 2 , . . y m ), M is a matrix of the form 
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ft 

yi 

O 

O 

. 0 

0 

0 

0 



a 2 

0 2 

72 0 . . . 

. 0 

0 

0 

0 



0 

«3 

03 73 • . • 

. 0 

0 

0 

0 


M = 

• 

• 

• 

• 

• 

• 

• 



0 

0 

0 0 . . . 

. 0 

OCm - 1 

0m- 1 

7 m - 1 



0 

0 

O 

O 

. 0 

0 

OCm 

0 m 


and c is a vector. 

The entries in M and 

c are 

all known since p, 

q, and r are 


known functions. For obvious reasons, the matrix M is said to be tridiagonal. 
Tridiagonal matrices can be numerically inverted easily and quickly; and, 
therefore, equation (11-49) can readily be solved by inverting M to find the 
solution vector y. A particularly convenient technique is known as the line 
inversion method. In setting up finite difference problems it is not always 
possible to obtain tridiagonal or even n-diagonal matrices. However, whenever 
possible, the difference equations should be set up to obtain n-diagonal matrices 
since they can usually be inverted more easily than other types of matrices. 
In addition, when the system is programed for a digital computer, it is not 
necessary to define all m 2 locations of the coefficient matrix M; in the case of 
a tridiagonal matrix, for example, only 3 m locations need be allocated to M 
while performing the inversion. In any case the matrices which arise in the 
finite difference methods may usually be inverted by either implicit or explicit 
means. For a discussion of the various methods for accomplishing this, the 
reader is referred to references 46 and 47. It sometimes happens, however, 
that it is not possible to invert these matrices; it is then necessary to use an 
iterative process to solve the matrix equation (see ref. 47). 


11.3.2 Nonlinear Equations 

11.3.2.1 Shooting methods. — Boundary-value problems for nonlinear 
equations can be solved by using “ shooting methods .” To use these methods 
the problem is first transformed to an initial-value problem by guessing enough 
additional initial conditions at one boundary to allow the integration to proceed 
across the interval to the other boundary. In the initial trial the specified 
boundary conditions at the second boundary are unlikely to be met. But, by 
adjusting the additional initial conditions imposed at the first boundary, it is 
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possible to come closer to satisfying the prescribed boundary conditions by 
carrying through a number of iterations. 

The required adjustments to the additional initial conditions can be 
made in a number of different ways. Perhaps the most common of these is 
linear interpolation. In order to illustrate the ideas involved, consider a two- 
point boundary-value problem for a second-order system on the interval 
a *£ x ^ b in which the boundary conditions y(a)=A and y(6) = B are 
specified. And suppose that the initial slope y (a) is to be adjusted until the 
second boundary condition is satisfied. When two values of the initial slope 
yi(a) and y 2 ' (a) have been found which lead to the two boundary values yi(b) 
and yz(b), respectively, such that 

yi(b) ^ B ^ y 2 (b) 

the next trial value of y' (a) is determined by linear interpolation by using the 
prescription 


y'(q)-yi(q) _ fl-yi(6) 
y'zia)— y'i(a) y 2 (b)-yi(b) 

If this new value of y' (a) leads to a value y(b) which is either smaller or larger 
than B , it can be used together with either y 2 (b) or yi{b), respectively, to 
repeat the process. The process can be continued until the second boundary 
condition is satisfied to within the desired accuracy. 

One difficulty with shooting methods which sometimes occurs is that the 
differential equation is so unstable that it “blows up” before the initial-value 
problem can be completely integrated. In such cases the process of quasi- 
linearization can be used. 

11.3.2.2 Quasi-linearization . — Nonlinear boundary-value problems can 
also be solved by reducing them to linear problems by a process of quasi- 
linearization. This process consists of replacing the original equation by a 
sequence of linear equations in such a way that the sequence of solutions to 
these equations converges to the solution of the original equation. Thus, 
consider the first-order normal equation 


£= c <^> 


(11-50) 


344 


NUMERICAL METHODS 


If y is close to y, we might anticipate from Taylor’s theorem that 

c)C 

G(x, y) ~ G(x, 9)+-^ ( x , y) (y-y) 

We, therefore, replace equation (11-50) by the sequence 
dv (n+1) 

fa-=G(x, y in) ) + J^ (x, y(»))[y(n+l)- y (»)] for n = 0, 1,2, . . . 

(11-51) 

of linear equations for y< n+1 >. To use these equations we, first, choose a reason- 
able guess for y 0) , calculate y* x) , insert it in the right side, and calculate y^ 2 \ 
Proceeding in this manner, we obtain a sequence of solutions y n) which we 
hope will converge to the solution y of the original problem. Since each equa- 
tion (11-51) is linear, it can be handled by the methods described previously. 
These ideas are easily generalized to the first-order normal system 

fx = G(x - y) 

and, therefore, to any normal system of differential equations. The basic 
reference on the quasi-linearization process is Bellman and Kalaba (ref. 48), 
in which are discussed various conditions which can be imposed on the func- 
tions G to ensure that the iterations converge. 

Although all the most commonly used methods are described in this 
chapter, it is impossible in a single chapter to touch upon all the available 
techniques. For a more complete treatment of the subject, the reader is referred 
to the references cited. 
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Absolute value of complex number, 113 
Adams’ method, 319 
Adams-Bashforth formula, 340 
Adams-Moulton formula, 340 
Adjoint equation, 99, 105 
Affine group, 93 
Airy’s equation, 288 
Analytic continuation 
along a simple curve, 134 
definition, 133 

in neighborhood of singular points, 137 
of multiple-valued functions, 139 
of solution of differential equation, 160 
specific method, 135 

Analytic functions of a complex variable, 114, 116 
Analytic functions of n complex variables, 146 
Associated homogeneous equation, 24 
Associated Legendre equation, 229 
Associated Legendre functions, 231 
Asymptotic expansion, 262, 265 
nonuniformly valid, 287 
of a solution, 267, 285 
Asymptotic power series, 265 
Asymptotic sequence of functions, 264 
Asymptotic series, defined, 263, 265 
Autonomous systems of differential equations, 59 

Bernoulli’s equation, 45 
Bessel’s equation, 241, 277, 288 
Bessel function, 278 
of first kind, 242 
of second kind, 244 
modified, of first and third kind, 246 
Beta function, 156 
Boundary conditions, 17, 319, 341 
Branch cut, 144 

Branch of analytic function, 145 
Branch point, 141 

Canonical basis of solutions, 168 
Cauchy estimates, 123 


Cauchy product of two series, 126 
Cauchy-Riemann equations, 114 

Characteristic curves of partial differential equation, 68 
Characteristic equation 
of partial differential equation, 68 
of two algebraic equations, 162 
Characteristic exponents, 177 
difference an integer, 195 
difference not an integer, 188 
Circle of convergence, 122 
Complementary function, 32 
Complete analytic function, 139 
Complex integration, 129 
Complex plane, 111, 116 
Complex variables, 111 

Confluence of singularities in Riemann-Papperitz 
equation, 233 

Confluent hypergeometric functions, 235 
Conformal mapping, 116 

Critical points of first-order autonomous systems, 64 

Degree of differential equation, 8 
Derivatives 

of functions of complex variables', 113 
of vectors, defined, 58 
Difference 

forward, backward, central, 334 
Difference equations, 320 
one-step, 322 
n-step, 334, 336 
Differential equation 
containing a parameter, 281 
definition, 7 
exact, second order, 93 
equivalent to linear equation, 101 
fundamental theorem, 13, 148, 151 
homogeneous, 8 
linear, 8 
normal, 13 
of first degree, 35 
of first order, 35 
partial, 68, 69 
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Differential equation in complex plane, 148 
analytic continuation of solution, 150 
Taylor series expansion of solution, 149 
Discretization error (see truncation error) 

Divergent series, as solution of differential equations, 256 
Domain, 2, 114, 121, 123 

Elliptic equation, 79 
Elliptic functions, 79 
Entire function, 121 
Equation of Painleve, 102 
Equation splitting, 109 

Equation with dependent variable missing, 74 
Equilibrium solution of first-order autonomous system, 64 
Equivalent equations under change of variable (second 
order), 105 

Essential singularity, 120 
ofp(z) andq(z),252 
Euler’s constant, 245 
equation, 83 

equation, homogeneous, 107 
formula, 112 

integral of second kind, 154 
method, 319, 324 
method improved, 339 

Factorial function, generalized, 155 

Finite -difference method, 341 

Finite points of the complex plane, 117, 121 

First integral, 12 

Fuchs theorem, 149 

Fuchsian equation, 201 

with two singular points, 202 
with three singular points, 203 
Fuchsian invariant for equation of order two, 206 
Function 
implicit, 5 
single valued, 3 

Functionally independent integrals of a system, 55 
Fundamental set of equations, 151, 159, 168 
Fundamental solutions, 33 
Fundamental theorem 

of analytic continuation, 132, 141 
of first-order normal system, 50 

Gamma function, 154 
Gegenbauer polynomial, 227 
Geometric series, 125 

Group theory method for reducing order of equation, 91 

Hankel functions of first and second kind, 246 
Hermite polynomial, 249 

Holomorphic functions of a complex variable, 114 


Homogeneous equation 
first order, 43 
of order n, 8 
second order, 79 

Homogeneous Euler equation, 107 
Homogeneous function of degree k, 45 
Homogeneous solution, 24 
Hypergeometric equation 210 
derivative of, 223 
integral form, 222 
Rummer’s 24 solutions, 220 
logarithmic solution, 225 
of Gauss, 212 
Whittaker’s form, 238 
Hypergeometric functions, 215 
generalized, 224 

Implicit function theorem, 5 
Indicial equation, 177, 199 

of Fuchsian equation with two singular points, 203 
of Fuchsian equation with three singular points, 205 
Infinitesimal operator, 87 
Infinitesimal transformations, 87 
Initial conditions, 13, 319 

Inner and outer expansions of solution near a transition 
point, 297 
Instability 

in predictor-corrector methods, 340 
induced, 322 
inherent, 321 
Integral 
first, 95 

of differential equations, 12 
of system of equations, 51 

Integral curves of first-order autonomous systems, 64 
Integrating factor 

of first-order equation, 41 
of second-order equation, 97 

Interchange of dependent and independent variables, 107 
Invariant of second-order equation, 105 
Irregular singular points, 173, 251 
Isobaric equation, 43, 81, 93 
Isobaric function, 82 
Isolated singular points 
of analytic functions, 119 
of second-order linear equation, 159 

Jacobi differential equation, 225 
Jacobi polynomial, 226 
Jacobian determinant, 52 

Rummer’s confluent hypergeometric equation, 235 
first formula, 240 
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second formula, 243 

24 solutions to hypergeometric equation, 220 

Laguerre polynomials, 240 
Lame equation, 249 
Laurent series, 128 
Legendre 
equation, 228 
function of first kind, 228 
function of second kind, 228 
polynomial, 228 
transform, 108 
Linear equations 
differential, 24 
first order, 35 

Linear fractional transformation, 117, 209 

Linear homogeneous equation, 106 

Linear independence, 25 

Linear second-order equations, 74 

Linearly independent solutions in complex plane, 151 

Liouville’s theorem, 121 

Logarithmic function of complex variable, 145 

Magnification group, 88 
Matched asymptotic expansions, 295 
Mathieu equation, 249 
Meromorphic function, 121 
Method of variation of parameters, 84 
Milne’s method, 340 
Modulus of complex number, 113 
Monodromy theorem, 141 (footnote) 

Multiple-time methods, 311 
Multiple- valued function, 139 

re-dimensional vectors, 57 

Neighborhood, 2 

Nonuniform-distortion group, 93 

Normal differential equation, 12 

Normal form of second-order equation, 105 

Normal solutions of irregular singular points, 269 

Normal systems of equations, 49 

One-dimensional equation, 43 
One-step processes, 322 
Order 

defined, 263 

of a differential equation, 8 
symbols, 263 
Ordinary point, 159 

Parabolic cylinder functions, 248 
Parameter, expansions of solutions in, 282 
Partition of an interval, 320 
p-discriminant, 23 
Pockhammer-Barnes function, 236 


Point at infinity, 117, 173 
Polar representation of complex number, 112 
Pole of an analytic function, 119 
Polynomial, 121 

Power series in complex variable, 122 
Predictor-corrector methods, 337 
Primitive, 9 

Principal branch of the logarithm, 145 

Principle of permanence of functional relations, 146 

Quasi-linear partial differential equation, 69 
Quasi-linearization of nonlinear equations, 344 

Radius of convergence, 122, 160 
Rank of an equation, 269 
Rational function, 121 
Recurrence relation, 177, 188, 190, 196 
Regular singular point, 172 
Regular solutions, 170 
Removable singularity, 119 
Riccati equation, 46, 80 
Riemann-Papperitz equation, 205 
Riemann P-symbol, 206 

Rodrigues formula for Jacobi polynomials, 226 
Round-off error, 321 
Runge-Kutta method, 328, 332 

Second-order equations, 73 

with dependent variable missing, 74 
with independent variable missing, 77 
Secular terms of a solution, 307 
Separable equation (first order), 43 
Shooting methods, 343 
Simple pole, 119 
Simply connected domain, 38 
Single-parameter Lie group, 86 
Singular locus, 23 
Singular point, 25 
of a differential equation, 35, 159 
Singular solutions, 19, 21 
Singularity of solutions, 169 
Solution 
curves, 64 

of differential equation, 8 
of equation, 5 
general, 10 
particular, 8 
singular, 11, 19 

Solution of second-order linear equation 
at ordinary point, 174 
at regular singular point, 174 
by change of variables, 104 
near irregular singular point, 251 
near isolated singular point, 160 
Stokes phenomenon, 266, 277 
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Strained coordinates, method of, 306 
Subnormal solutions of irregular singular points, 274 
Superposition principle, 24 
Systems of equations, normal, 49 

Tables of differential equations and solutions, 110 
Taylor series, 123 
Taylor series method, 322 

Transformation to equation with constant coefficients, 106 
Transition point, 287 
Translation group, 92 
Truncation error, 320 
local, 322 


Tschebychev equation, 228 
Tschebychev polynomial, 228 

Ultraspherical equation, 227 
Ultraspherical polynomial, 227 (footnote) 

Vector functions, 58 

Weber equation, 248 
Weber function, 248 

Whittaker’s confluent hypergeometric equation, 238 
Wronskian, 103, 152, 170 
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